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1. Introduction 



The universality of random matrices has been a central subject since the pioneering work of Wigner [40] . 
Gaudin [57] , Mehta [3D] and Dyson [T2] . The problem can roughly be divided into the bulk universality in the 
interior of the spectrum and the edge universality near the spectral edge. The bulk and edge universalities 
for invariant ensembles have been extensively studied; see e.g. [4j|8l[9l[31] and [Tl[6l[7] for a review. A key 
contributing factor to the progress in the study of invariant ensembles is the existence of explicit formulas for 
the joint density function of the eigenvalues. There is no such explicit formula for non-invariant ensembles, 
and hence our understanding of them is much more limited. The most prominent examples for non-invariant 
ensembles are the Wigner matrices with i.i.d. non-Gaussian matrix elements. The edge universality of 
Wigner matrices can be proved via the moment method and its various generalizations; see e.g. |32fl34j . 
The bulk universality for general classes of Wigner matrices was listed in Mehta's book [30] as Conjectures 
1.2.1 and 1.2.2 on page 7. We shall refer to these two conjectures collectively as the Wigner-Dyson-Gaudin- 
Mehta conjecture, in recognition of the pioneering works of Wigner, Dyson, Gaudin, and Mehta listed above. 
It remained unsolved until very recently. This is mainly due to the fact that all existing methods on local 
eigenvalue statistics depended on explicit formulas, which are not available for Wigner matrices. In a series of 
papers [T5H23] . a new approach to understanding the local eigenvalue statistics was developed. In particular, 
it led to the first proof [TH] of the Wigner-Dyson-Gaudin-Mehta conjecture for Hermitian Wigner matrices 
whose entries have smooth distributions. This approach is based on three basic ingredients: (1) a local 
semicircle law - a precise estimate of the local eigenvalue density down to energy scales containing around 
{logN)^ eigenvalues; (2) the eigenvalue distribution of Gaussian divisible ensembles via an estimate on the 
rate of decay to local equilibrium of the Dyson Brownian motion |12| : (3) a density argument which shows 
that for any probability distribution of the matrix elements there exists a Gaussian divisible distribution such 
that the two associated Wigner ensembles have identical local eigenvalue statistics down to the scale 1/iV. 
Furthermore, the edge universality can also be obtained by some modifications of (1) and (3) [23]. The class 
of ensembles to which this method applies is extremely general; in particular it includes any (generalized) 
Wigner matrices under the sole assumption that the distributions of the matrix elements have a uniform 
subexponential decay. We remark that the universality of Wigner matrices, under certain restrictions on 
the distribution of the matrix entries, was also established in [36li37]. We shall discuss these results in the 
companion paper [13] . 

In this paper and its companion [13], we extend the approach (l)-(3) to cover a class of sparse matrices. 
This class includes the Erdos-Rcnyi matrices, which we now introduce. Symmetric N x N matrices with 0-1 
entries arise naturally as adjacency matrices of graphs on N vertices. Since every non-oriented graph can 
be uniquely characterized by its adjacency matrix, we shall from now talk about matrix ensembles (with 0-1 
entries) and graph ensembles interchangeably. We shall always normalize the matrices so that their spectra 
typically lie in an interval of length of order one. One common random graph ensemble is the Erdos-Renyi 
graph |241I25]. In it each edge is chosen independently and with probability p = p{N). Since each row and 
column of the adjacency matrix has typically pN nonzero entries, it is sparse as long as p 1. We shall 
refer to pN as the sparseness parameter of the matrix. 

Our goal in this paper, and in its companion [13j . is to establish both the bulk and edge universalities for 
the Erdos-Renyi ensemble under the restriction pN ^ N'^/'^. In other words, we prove that the eigenvalue 
gap distributions in the bulk and near the edges are given by those of the Gaussian Orthogonal Ensemble 
(GOE) provided that pN ^ N^^^. We remark that the law of the Erdos-Renyi ensemble is even more 
singular than that of the Bernoulli Wigner matrices, since the matrix elements are highly concentrated 
around 0. Another way of expressing the singular nature of the Erdos-Renyi ensemble is to say that the 
moments of the matrix entries decay much more slowly than in the case of Wigner matrices. 
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The matrix elements of the Erdos-Rcnyi ensemble take on the values and 1. Hence they do not satisfy 
the mean zero condition which typically appears in the random matrix literature. Due to the nonzero mean 
of the entries, the largest eigenvalue of the Erdos-Renyi ensemble is very large and far away from the rest 
of the spectrum, which by our normalization lies in the interval [—2,2]. By the edge universality of the 
Erdos-Renyi ensemble, we therefore mean that the probability distribution of the second largest eigenvalue 
is given by the distribution of the largest eigenvalue of the GOE, which is the well-known Tracy- Widom 
distribution. 

As the first step of the general strategy to establish universality, we shall prove a local semicircle law, 
Theorem l2.9[ stating that the eigenvalue distribution of the Erdos-Renyi ensemble in any spectral window of 
size 7] containing on average Ni] ^ (logN)^ eigenvalues is given by the Wigner semicircle law. Theorem l2.9l 
is valid in the bulk and at the edges as long as the parameter p = p{N) satisfies pN — > oo with a rate at least 
logarithmic in N. Similar results but for much larger spectral windows (of lengths at least rj ^ {pN)~^^^^) 
were recently proved in |38| . 

We note that the semicircle law for Wigner matrices in spectral windows of size 77 ~ N^^/"^ has been 
known for some time The semicircle law in the smallest possible spectral window (of size 77 > iV~^ 

in the bulk) was established in [TnillT]. This estimate, referred to as the local semicircle law, has become a 
fundamental tool in the proofs of the universality of random matrices in jl9j as well as in the subsequent 
works |181I36| . The local semicircle law in [161I17| is optimal in terms of the range of 77, but the error 
estimates, of order (TV 77)"^/^ in the bulk and with a coefficient deteriorating near the spectral edges, were 
not optimal. Optimal error estimates, uniform throughout the entire spectrum and valid for more general 
classes of Wigner matrices, were obtained in f23[ . The local semicircle law proved in this paper can also be 
viewed as a generalization of the results in |23| in two unrelated directions: (a) the laws of the matrix entries 
are much more singular, and (b) the matrix entries have nonzero mean. 

Besides eigenvalues, eigenvectors also play a fundamental role in the theory of random matrices. One 
important motivation for their study is that random matrices can be viewed as mean-field approximations 
of random Schrodinger operators where delocalization of eigenfunctions is a key signature for the metallic or 
conducting phase. Another question about eigenvectors of random graphs is the size of their nodal domains, 
which can studied using delocalization bounds [10]. It was first proved in [T5j that eigenvectors for Wigner 
matrices are completely delocalized, partly motivated by a conjecture of T. Spencer. The method was refined 
in [inillTj , and was also adapted in [3S1|3H] • The key observation behind the proof is that the delocalization 
estimate for eigenvectors follows from the local semicircle law provided that the spectral windows can be 
made sufficiently small. Thus Theorem 12.91 also implies, with v„ denoting the £^-normalized eigenvectors, 
that 



holds with exponentially high probability with some constant C (Theorem 12. 16p . This establishes the com- 
plete delocalization of all eigenvectors as long as the sparseness parameter pN increases at least logarith- 
mically in N. In particular, this result gives the optimal answer to a question posed in Section 3.3 of jlOj. 
asking whether |jva||oo ^ A^^i/2+o(i) holds for all eigenvectors v„. In fact, this question was originally posed 
for fixed p, but our result shows that the bound conjectured in [10| holds even for p ^ (XogN)'^ N^^ . It 
was recently proved in [38] that ||vq||oo ^ {pN)~^^-^ away from the spectral edge; some earlier results were 
obtained in [11] . These results established only the lower bound pN on the localization length; the complete 
delocalization (jl.ip corresponds to the optimal localization length of order N. 

Our main result on the bulk and edge universalities will require a further condition 



c 



max 



(1.1) 



pN > N^/^. 



(1.2) 
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This and related issues will be discussed in the second paper |13) . 



2. Definitions and results 

We begin this section by introducing a class oi N x N sparse random matrices A = A^. Here TV is a large 
parameter. (Throughout the following we shall often refrain from explicitly indicating A'^-dependence.) 

The motivating example is the Erdos-Renyi matrix, or the adjacency matrix of the Erdos-Renyi random 
graph. Its entries are independent (up to the constraint that the matrix be symmetric), and equal to 1 with 
probability p and with probability 1 — p. For our purposes it is convenient to replace p with the new 
parameter q = q{N), defined through p ~ q'^/N. Moreover, we rescale the matrix in such a way that its bulk 
eigenvalues typically lie in an interval of size of order one. 

Thus we are led to the following definition. Let A = (aij) be the symmetric N x N matrix whose entries 
Gij are independent (up to the symmetry constraint = Qji) and each element is distributed according to 



with probability 1 — ^ . 



Here 7 := (1 — q^ /N)~'^/'^ is a scaling introduced for convenience. The parameter q ^ N^/"^ expresses the 
sparseness of the matrix; it may depend on N . Since A typically has q^N nonvanishing entries, we find that 
if g <C N'^/'^ then the matrix is sparse. 

We extract the mean of each matrix entry and write 

A = H + -/q\e){e\, 

where the entries of H (given by hij = — ^q/N) have mean zero, and we defined the vector 

e ^ GAT := -^(l,...,lf . (2.2) 



(As above, we often neglect the subscript of g; the precise value of this subscript will always be clear from 
the context.) Here we use the notation |g)(g| to denote the orthogonal projection onto e, i.e. (|G)(G|)y := 

One readily finds that the matrix elements of H satisfy the moment bounds 



E/i?. = — , E|/i„r ^ — 5-^, (2.3) 
where p ^ 2. 

More generally, we consider the following class of random matrices with non-centred entries characterized 
by two parameters q and /, which may be iV-dependent. The parameter q expresses how singular the 
distribution of hij is; in particular, it expresses the sparseness of A for the special case (|2.ip . The parameter 
/ determines the nonzero expectation value of the matrix elements. 

Throughout the following we shall make use of a (possibly iV-dcpendcnt) quantity = S,n satisfying 

1 + ao ^ e ^ AologlogTV, (2.4) 

for some fixed positive constants ao > and ^0 J? 10. The parameter ^ will be used as an exponent in 
logarithmic corrections as well as probability estimates. 



4 



Definition 2.1 (H). Fix a parameter ^ = satisfying (|2.4p . We consider N x N random matrices 
H ~ (hij) whose entries are real and independent up to the symmetry constraint hij = hji. We assume that 
the elements of H satisfy the moment conditions 



1 fp 

Eh,,=0, E|/.,,f = -, E\h,,f ^ (2.5) 



for 1 ^ i, j ^ N and 3 ^ p ^ (Jog TV)"^" where C is a positive constant. Here q = q{N) satisfies 

(log7V)^« s; g CN^/^ (2.6) 

for some positive constant C . 

Note that the entries of H exhibit a slow decay of moments. The variance is of order N^^, but higher 
moments decay at a rate proportional to inverse powers of q and not N^^"^ . Thus, unlike for Wigner matrices, 
the entries of sparse matrices satisfying Definition 12 . 1 1 do not have a natural scale. (The entries of a Wigner 
matrix live on the scale N~^^'^, which means that the high moments decay at a rate proportional to inverse 
powers of See Remark [23] below for a more precise statement.) 

Definition 2.2 (A). Let H satisfy DeHnition WH Define 

A := F + /|e)(e|, (2.7) 

where f = f{N) is a deterministic number that satisfies 

^ f ^ , (2.8) 

for some constant C > 0. 

Remark 2.3. For definiteness, and bearing the Erdos-Renyi matrix in mind, we restrict ourselves to real 
symmetric matrices satisfying Definition [221 However, our proof applies equally to complex Hermitian sparse 
matrices. 

Remark 2.4. To simplify the presentation, we assume that all matrix elements of H have identical variance 
As in [3T], Section 5, one may however easily generalize this condition and require that the variances 
be bounded by C/N and their column sums (hence also the row sums) be equal to 1. Thus one may for 
instance consider Erdos-Renyi graphs in which a vertex cannot link to itself (i.e. the diagonal elements of A 
vanish). 

Remark 2.5. In particular, we may take to be a Wigner matrix whose entries have subexponential decay, 

F{N^^^\hij\;? x) ^ Cexp(-a:i/^) 
for some positive constants 6 and C. Indeed, in this case we get 



Now we choose 

q := iVi/2(^(log7V)^°'°siogJV^-e^ 

Since q^^ ^ (log7V)'^'°s'°s^7V-i/2, we find that all factors q^^ in error estimates such as and 
below may be replaced with N~^/'^ at the expense of a larger exponent in the preceding logarithmic factors. 
In fact, using Lemma 13.21 below, it is easy to see that in this case all terms depending on q in estimates such 
as p.l6p and p.l7p may dropped, as they are bounded by the other error terms. In particular, Theorem l2.8 
generalizes Theorem 2.1 of [23] . 
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We shall frequently have to deal with events of very high probability, for which the following definition 
is useful. It is characterized by two positive parameters, ^ and i/, where ^ is subject to (j2.4p . 

Definition 2.6 (High probability events). We say that an N -dependent event Q, holds with (^, i/)-high 
probability if 

P(n'=) e-''('°s^)' (2.9) 

/or iV ^ iVo(i/, ao,Ao). 

Similarly, for a given event flo, we say that fl holds with (^, i/)-high probability on if 
forN^ No{v, ao,Ao). 

Remark 2.7. In the following we shall not keep track of the explicit value of v; in fact we allow v to 
decrease from one line to another without introducing a new notation. It will be clear from the proof that 
such reductions of v occur only at a few, finitely many steps. Hence all of our results will hold for u ^ 
where depends only on the constants C in Definition 12.11 and the parameter S in (|2.10p below. (In 
particular, v is independent of ^.) 

We shall use C and c to denote generic positive constants which may only depend on the constants in 
assumptions such as (|2.4p and (j2.5p . Typically, C denotes a large constant and c a small constant. Note 
that the fundamental large parameter of our model is A^, and the notations ^,<cO(-)jo(-) always refer to 
the limit TV — >■ oo. Here a ^ 6 means a = o{h). We write a ^ h for C^^a ^ b ^ Ca. 

We now list our results. We introduce the spectral parameter 

z = E + iri 

where E gM. and rj > 0. Let E ^ 3 be a fixed but arbitrary constant and define the domain 

D := {z eC : \E\ ^T., <ri . (2.10) 
We define the density of the semicircle law 

Qscix) := l-^[A-x^]+, (2.11) 

and, for Imz > 0, its Stieltjes transform 

m,,{z) := / ^^dx. (2.12) 
The Stieltjes transform msdz) = rrisc may also be characterized as the unique solution of 

+ = (2.13) 

z + m.sc 

satisfying \Ttvmsc{z) > for Imz > 0. We define the resolvent 

G{z) {H-zr\ 
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as well as the Stieltjes transform of the empirical eigenvalue density 

m{z) ^TrG(z). 

For a; G R we define the distance to the spectral edge through 

||x|-2|. (2.14) 

Theorem 2.8 (Local semicircle law for H). There are universal constants Ci,C2 > such that the 
following holds. Suppose that H satisfies Definition \2.1\ Moreover, assume that 

^ = ^^^^^^\og\ogN , q^{\ogNf^^. (2.15) 

Then there is a constant v > Q, depending on Aq, Yi, and the constants C in (|2.5p and (|2.6p . such that the 
following holds. 

We have the local semicircle law: the event 

n||m(.)-...(.)| ^(logiV)-^^(min{-,^^,l} + ^)| (2.16) 

holds with (^, v)-high probability. Moreover, we have the following estimate on the individual matrix elements 
of G. The event 



n< max 

z<£D I 



\G,,iz) - <5.,m,,(z)| (logiV)^^« ('1 + + 1 ) k (2.17) 



holds with {(^,v)-high probability. 

The results of Theorem 12.81 may be interpreted as follows. Consider first the bulk, i.e. ke ^ c> Q. Then 
Theorem 12.81 states roughly that 

\m{z) - nisciz)] ^ ^ + ]^' \Gij{z) - 5ij'msc{z)\ < ^ + —^=, (2.18) 

up to logarithmic factors. Since |77isc(2)| ^ 1, both estimates are stable in a sense that they identify the 
leading order terms of m and Ga down to the optimal scale 77 > 7V~^. Note that choosing 77 < in 



Imm(z) = 



allows one to resolve individual eigenvalues Aq of H. Therefore below the scale 77 < N~-^ the quantities tti 
and Gii become strongly fiuctuating and the these fluctuations are larger than the main term. In the regime 
rj ^ (log N)^^N^^ in which the fluctuations are smaller than the main term, a spectral window of size rj 
contains at least (log A^)*-^^ eigenvalues; hence an averaging takes place. 

The factor 1 /q on the right-hand side of the second inequality of p.l8p arises from the strong fluctuations 
of the matrix entries hij, which take on values of size with probability of order q^N^^. Indeed, it is 
apparent from the representations p.l3p and (|3.23p that = ml^hij + . . ., i.e. Gij has a component that 
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fluctuates on the scale q~^. The improvement from to q^^ in the first inequahty of p.lSp arises from an 
averaging in the summation m = iV^^ Gu. If the random variables in the average were independent, one 
would expect the averaging to yield an improvement of order iV~^/^; however in our case there are strong 
dependencies, which result in the more modest gain of order q"^ . 

At the edge [ke = 0), the estimates (|2.16p and (|2.17p may be roughly stated as 

1.1 1^ . ^ ..M < 1 . , 1 



\m{z) - msc{z)\ < - + — , \Gij{z) - SijiTisciz) 



Now we formulate the local semicircle law for the matrix A given in Definition 12.21 Define the quantities 

G{z) := {A~z)-\ m{z) := ^TrG(z). (2.19) 

Theorem 2.9 (Local semicircle law for A). There are universal constants Ci,C2 > such that the 
following holds. Suppose that A satisfies Definition \ 2.'A and that ^ and q satisfy (j2.15p . Then there is a 
constant v > Q - depending on Aq, E, and the constants G in (|2.5p . (j2.6|) . and (j2.8p - such that the following 
holds. 

We have the local semicircle law: the event 



zeD I 



) - mUz)\ ^ (logiV)-^ ( mm^ 1} + ^ ) ^ (2.20) 



holds with (^, v)-high probability. Moreover, we have the following estimate on the individual matrix elements 
of G. If the assumption (j2.8p is strengthened to 

< / Coiv'/^ (2.21) 

for some constant Gq, then the event 



irrax^|G,(z)-^,m..(z)| ^ i^og Nf'^^ + J '^^^^ + ± ] } (2.22) 



holds with (^,i')-high probability, where v also depends on Gq. 

Next, let Ai ^ ••• ^ A^v be the ordered family of eigenvalues of H, and let Ui,...,UAr denote the 
associated eigenvectors. Similarly, we denote the ordered eigenvalues oi A hy fii ^ • • • ^ fi^ and the 
associated eigenvectors by Vi, . . . , vjv- We use the notations Uq = {ua{i))fLi and ~ {va{'i))iLi for the 
vector components. All eigenvectors are ^^-normalized and have real components. 

We state our main result about the local density of states of A. For Ei < E2 define the counting functions 

Afsc{EuE2) N Qsc{x)dx N{Ei,E2) := \{a : Si < /i„ i^a}] • (2.23) 

J El 

Theorem 2.10 (Local density of states). Suppose that A satisfies Definition \2.2\ and that ^ and q 
satisfy (j2.15p . Then there is a constant v > - depending on Aq, T,, and the constants G in (j2.5p . (|2.6p . 
and (j2.8p - as well as a constant C > such that the following holds. 
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For any Ei and E2 satisfying E2 ^ Ei + (iog N)^^N ^ we have 



N{Ei,E2) = Nsc{Ei,E2) 



1 + (logTV) 



N{E2-Ei)y^ q'{E2-Ei: 



(2.24) 



with {^,u)-high probability. 

Away from the spectral edge we have a stronger statement. Fix > 0. Then, for any Ei and E2 
satisfying E2 ^ Ei + (logN)'^^N~'^ as well as ke^ 1^ k* and 5^ we have 



M{Ei,E2) = Afsc{Ei,E2) 



(2.25) 



with (^,v)-high probability, where the constant in 0{-) depends on 



Remark 2.11. Both results (j2.24p and p.25p are special cases of a more general, uniform, estimate; see 
Proposition [ 



In the recent work [38], the asymptotics of the local density of states was also established, but only in 



much larger spectral windows, of size at least (Np) 



-i/ia 



q 



and with a weaker error estimate. 



Our next result concerns the integrated densities of states. 



nsc{E) ^Afsc{~(x^,E), n{E) 



N 



U{-oo,E) 



(2.26) 



Theorem 2.12 (Integrated density of states). Suppose that A satisfies Definition \2.S\ and that ^ and 
q satisfy (|2.15p . Then there is a constant h' > - depending on Aq, T,, and the constants C in (j2.5p . (|2.6p . 
and (12.81) - as well as a constant C > such that the event 



n 



Be[-s,s] 

holds with (^,i')-high probability. 



\n{E) -nUE)\^ {log N)'^^- + -^ 



1 



1 



(2.27) 



Next, we prove that the — 1 first eigenvalues of A are close the their classical locations predicted by 
the semicircle law. Denote by 7q. the classical location of the a-th eigenvalue, defined through 



nscilc, 



a 
N 



for a ~ I, 



(2.28) 



The following theorem compares the locations of the eigenvalues ^i, . . . , p,N-i to their classical locations 
7i, . . . , jN-i- It is well known that the largest eigenvalue fiN of the Erdos-Renyi matrix is much larger than 
7JV- This holds for more general sparse matrices as well; more precisely, if / ^ 1 + c then fijy ~ / + /^^ is 
separated from fiN~i ~ 2 by a gap of order one. The precise behaviour of /iyv in this regime is established 
in Theorem 16.21 below. 



Theorem 2.13 (Eigenvalue locations). Suppose that A satisfies Definition \2.S\ and that^ satisfies (|2.15p . 
Let (j) be an exponent satisfying < ^ 1/2, and set q = N'^ . Then there is a constant ly > - depending on 
Aq, S, and the constants C in (j2.5|) . (j2.6p . and (j2.8p ~ as well as a constant C > such that the following 
holds. 
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We have with (^,i')-high probability that 



N-l 



E Ia^" - ^ i^ogNfi {N'-'^ + . (2.29) 

Q = l 

Moreover, for all a — 1, . . . , N ~ 1 we have with v)-high probability that 

iMa - 7a I ^ (log Nf^ [a-1/3 + i(a ^ (log NfH^ + N'-'^)^] + 7V2/3-4^a-2/3 + ^ 

(2.30) 

where we abbreviated a min{a, iV — a}. 

Remark 2.14. Under the assumption (/) ^ 1/3 the estimate (|2.30p simplifies to 

lMa-7a| ^ i\ogNf^[N-'/'S-'/' + N-'^), (2.31) 
which holds with (^, z^)-high probability. 

Remark 2.15. Theorems 12.101 12.121 and J2.13l also hold - with the same proof - for the matrix H. More 
precisely, Theorem 12.101 holds with with J\f{Ei,E2) replaced with 

N{EuE2) := \{a : E^ <Xo,^E2}\, 

Theorem 12.121 holds with n{E) replaced with 

<E) := l-Af{~^,E), 

and Theorem 12.131 holds with replaced with Aq . 

Our final result shows that the eigenvectors of A are completely delocalized. 

Theorem 2.16 (Complete delocalization of eigenvectors). Suppose that A satisfies Definition \2.S\ 
and (|2.2ip . Then there is a constant v > ~ depending on Aq, S, and the constants C in (|2.5p . (|2.6p . and 
(|2.8p - such that the following statements hold for any ^ satisfying (j2.4p . 
We have with {£^,v)-high probability 

(log nY^ 

max|lv„|U < -^7^- (2-32) 

a<N y/]\f 

Moreover, we have with {£^,v)-high probability 



// additionally f ^ C for some constant C then we have with (^, v)-high probability 

(logAr)4« 

IIvatIIoo ^ ^ . (2.34) 



Finally, there exists positive constants C,Cq such that if f ^ Co(logiV)^ then we have with {£^,v)-high 
probability 

\Wn-b\\oo ^ C^§f. (2.35) 
V iV/ 
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Remark 2.17. If / docs not grow with N, then the components U7v(«) of the largest eigenvector fluctuate, 
and we do not expect (|2.35|) to hold. However, a delocalization bound similar to (|2.34p holds for all /. 
In (|2.34p this bound was proved for / ^ C. In fact, a slight modification of our proof yields complete 
delocalization for the values of / not covered by Theorem 12.161 i.e. 1 ^ / ^ Co(logiV)^. We claim that in 
this case we have, with (^, :^)-high probability, 

||v„|U ^ (2.36) 



The required modifications are sketched at the end of Section 17.31 below. 

Remark 2.18. Similarly, if H satisfies Definition 12. 1[ all of its eigenvectors are delocalized in the sense that 

„ „ / (logiV)4« 



N 

with (^, j^)-high probability. The proof is a straightforward application of p.4p below and the estimate (|7.25p 
applied to Gjj. 

In the recent work [35], a weaker upper bound of size {Np)~^^'^ = q^^ was established for the £°°-norm 
of the eigenvectors of A associated with eigenvalues away from the spectral edge. 



3. The weak local semicircle law for H 

In this section, we introduce and prove a weak version of the local semicircle law for the matrix H . This 
result is weaker than our final result for H, Theorem 12.81 but it will be used as an a priori bound for the 
proof of Theorem 12.81 Moreover, Theorem 13.11 holds under slightly weaker assumptions on ^ than Theorem 
12.81 and is for this reason a more suitable tool for proving eigenvector delocalization. Theorem 12.161 see 
Section [731 for details. 

We shall prove Theorem 13.11 (the weak local semicircle law) for spectral parameters z in the set 

Dl ■■= {z eC : \E\i^j:, {logNfN-^ s$ ?? s$ 3} C Z? , (3.1) 

where the parameter L = L{N) will always satisfy 

L ^ 8^. (3.2) 

Theorem 3.1 (Weak local semicircle law for H). Let H satisfy Definition \2.1l Then there are 
constants ly > and C > such that the following statements hold for any ^ satisfying (j2.4p and L satisfying 



The events 



n|,»,.„(=,Kf.s^ ,3.3, 



n \r f ^ r ,,^ C{\ogN)i , C(logjV)2q 

I I <^ max|G»4z) - m{z)\ + — — S (3.4) 
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hold with {^,i')-high probability. Furthermore, we have the weak local semicircle law: the event 



Pi < |m(z) - msc{z)\ < 
'.eDL [ 



C(logiV)« C(logiV) 



2? 



V9 i^vV^' 



(3.5) 



holds with {^,i')-high probability. 
Roughly, Theorem 13 . 1 1 states that 



|G,, -(5,,m(z)| < - + ^= (3.6) 



and 



\m{z)-mUz)\ < ;^+(]^- (3-7) 



Comparing with the strong local semicircle law, Theorem l2.81 we note that the error bound in p.6p for 
is already optimal in the bulk. However, unlike Theorem 12.81 the quantity Gu is compared to m and not 
rUsc. 

On the other hand, the estimate (|3.7|) is considerably weaker than the corresponding bound in (|2.18p . 
The smaller power 1/3 in the factor {Nvi)~^/^ reflects the instability near the edge; it appears because we 
insist on having uniform bounds up to the edge. If we were interested only in the bulk, it would be easy 
to repeat the proof of Theorem 13. II to obtain {Nr]K)~^^'^ , thus replacing the power 1/3 with 1/2. The price 
would be a coefficient which blows up at the edge. 

As in Thcorcm l2.81 the estimates of Theorem l3 . 1 1 are stable down to the optimal scale rj > N~^, uniformly 
up to the edge. Thus, the difference between Theorems 12 . 81 and 13.11 lies only in the precision of the estimates. 

In order to prove Theorem 13. li we first collect some basic tools and notations. 

3.1. Preliminaries. The following lemma collects some useful properties of nisc defined in (|2.13p . 
Lemma 3.2. For z = _E + ir/ G abbreviate k = ke- Then we have 

\msc{z)\ ~ 1, \l-mscizf\ ~ y^T^. (3.8) 

Moreover, 

lmmsc{z) - 

Here the implicit constants in ^ depend on S in (j2.10p . 



Proof. The proof is an elementary calculation; see Lemma 4.2 in [22]. □ 

In order to streamline notation, we shall often omit the explicit dependence of quantities on the spectral 
parameter z e D^; thus we write for instance Gij{z) = Gij. Define the z-depcndent quantities 

Ao := max|Gy|, A^ := max|Gji - m^cl , A := |m-?7isc|, Vi := Gu-nisc- (3.9) 
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Definition 3.3. LetT C {1, . . . ,N}. Then we define H'-'^^ as the {N - \T\) x {N - \T\) minor of H obtained 
by removing all rows and columns of H indexed by i ^T. Note that we keep the names of indices of H when 
defining H^'^^ . 

More formally, for i e {1, . . . ,N} we define the operation Hi on the probability space by 

MH))ki ■■= lik ^ i)l{l ^ i)hki . (3.10) 
For T C {1, . . . , N} we also write ttt := IligT Then we define 

:= iiMH))^J)^,J^T . 

The quantities G'^^^(z), \^a \ u'aK etc. are defined in the obvious way using H^'^K Here a = 1, . . . , amax; 
where amax ■= N - \T\. 

Moreover, we use the notation 

(T) AT 



i i—1 

and abbreviate (i) = ({*}) '^^ ''^^^^ C^*) = (T U {«}). 
We also set 

^ E G.iP ■ (3.11) 

i 

Note that wc choose the normahzation instead of the more natural {N — |T|)^^ in (|3.1ip : this is 
simply a convenient choice for later apphcations. 

The next lemma collects the main identities of the resolvent matrix elements . Its proof is elementary 
linear algebra; see e.g. pi] . 

Lemma 3.4. For i,j^k we have 



Gkk 

For i j we have 



Gij = G\^' + 5^ . (3.12) 



Gij — —GiiGjj [hij — Zij^ , Gii — {hii — z — Zn) ^ (3.13) 
where we defined, for arbitrary i,j G {1, . . . N}, 

(y) 

Z,, := h,.G('J")h, = J^h.kG'^^phi,. (3.14) 



k.l 



Here hi denotes the vector given by the i-th column of H . Note that in expressions of the form (|3.14p it is 
implied that the i-th and j-th entries of hi and hj have been removed; we do not indicate this explicitly, as 
it is always clear from the context. 

Remark 3.5. Lemma [3^ remains trivially valid for the minors 77^''") of H. For instance p.l2p reads 



^(T) _ ^(T/c) ^ik ^kj 
^ij ^ ^(T) 
'-'kk 



for i,j, k and i,j ^ k. 
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Definition 3.6. We denote by the partial expectation with respect to the variables hi = {hij)^^i, and set 

m,x ■.= x- ¥.,x. 

We abbreviate 



k,l k,l 

The following trivial large deviation estimate provides a bound on the matrix elements of H . 
Lemma 3.7. For C large enough we have with v)-high probability 

\h,j\ ^ -. 

q 

Proof. The claim follows by choosing p ~ v{\ogN)^ in (|2.5p and applying Markov's inequality. 



(3.15) 



□ 



We collect here the large deviation estimates for random variables whose moments decay slowly. Their 
proof is given in Appendix [A] 



Lemma 3.8. (i) Let (a^) be a family of centred and independent random variables satisfying 

CP 



E|a,|f ^ 



(3.16) 



for all2 i^p ^ (log A^)"^"'°s'°sJV^ where a ^ and /3,7 G M. Then there is a v > 0, depending only on 
C in (|3.16p . such that for all ^ satisfying (|2.4p we have with {^,i')-high probability 



^ (logiV)« 



sup i\Ai 



„/3+2a 



1/2- 



(ii) Let ai, . . . , a^r be centred and independent random variables satisfying 

CP 



E|a,|P s$ 



(3.17) 



(3.18) 



for 2 ^ p (log A^)"^" '°siog^ Then there is a v > 0, depending only on C in p.l8p . such that for all 
^ satisfying (|2.4p . and for any Ai G C and Bij G C, we have with {^,i>)-high probability 



N 



i=l 
N 



^ (logTV)^ 



I A; 



1 N ^ 1/2 



i=l 



<; (log7V)« — 



^ (logiV) 



2C 



N2 



1/2 



(3.19) 
(3.20) 
(3.21) 



where af denotes the variance of Oi and we abbreviated 



Ed := max|fiij| , Bo max|Bij| 
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(Hi) Let ai, . . . , and 61, . . . , &Ar be independent random variables, each satisfying (j3.18p . Then there is 
a i> > 0, depending only on C in p.lSp . such that for all ^ satisfying (|2.4p and Bij € C we have with 
v)-high probability 



N 



(log^) 



2? 



„2 



So 



1/2- 



(3.22) 



Remark 3.9. Note that the estimates p.l9p and p.20p are special cases of p.l7p . The right-hand side 
of the large deviation bound (|3.17p consists of two terms, which can be understood as follows. The first 
term gives the large deviation bound for the special case where Ai vanishes for all but one i; in this case 
it is immediate that \Aiai\ ^ (log-/V)^|Ai|q~" with (^, i')-high probability. The second term is equal to the 
variance of Aiai. In particular, p.l7p is optimal (up to factors of logiV). The estimates p.lQp - p.22p 
can be interpreted similarly. (Note that the powers of q in the estimates p.2ip - p.22p are not optimal; this 
is however of no consequence for later applications.) 



For a family Fi 



Fm we introduce the notation 



The following lemma contains the self-consistent resolvent equation on which our proof relies. 
Lemma 3.10. We have the identity 

1 



Gi. 



where 



-z - nisc - (H - Tj) 



Ti hii - Zi + Ai , 



(3.23) 



1 \ ^ GijGji 



G,. 

3 



Proof. The proof is a simple calculation using p.l3p and p.l2p . 



(3.24) 



□ 



3.2. Basic estimates on the event ri(z). 

Definition 3.11. For z e introduce the event 

n(z) := {Ad(z)+Ao(z) ^ (log7V)"«} 

and the control parameter 



' A(z) + Imnisciz) 



(3.25) 



(3.26) 
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Note that '^{z) is a random variable. Moreover, on f2(z) we have 'I'(z) ^ C(logiV) by (|3.2I) . 
Throughout the following we shall make use of the fundamental identity 



= yy M^M) y M^)Mj) ^ y KWP ^ ij^^ (327) 

A similar identity holds for H^'^\ Using the lower bound |msc(z)| ^ c from (|3.8p and the definition p.25p 
we find 

c < \Guiz)\ s; C (3.28) 
on r2(z). Using p.l2p repeatedly we find that on V,{z) we have 

c |g|P(z)| C (3.29) 

for |T| ^ 10 (here 10 ean be replaeed with any fixed number). Similarly, we have on f2(z) that 

max|G,^P(z)| < CA„(z) < C(log^)-« (3.30) 

for |T| s; 10. 

Lemma 3.12. Fixing z = E + it] Cz Dl, we have for any i and T C {1, . . . , N} satisfying i <^ T and |T| 10 
that 



(^-)(z) = ™m(.) + 0(^J-) (3.31) 
holds in fl{z). 

Proof. Wc use p.l2p to write 

m -, (iT) , («T) g-(T)g,(T) ^ (T) ^ (T) (^(T)g(T) 



J_ ^(") ^ J_ ^(T) _ i_ \^ ^ i_ V r^"^^ - — V 

j j j '^ii j j '^ii 



Therefore 



-, (") -, (T) . (T) . (T) , 



N n " V^^' ' / " \Nri 

j j j j 



.(T) 



The claim now follows from p.29p . □ 
Lemma 3.13. For fixed z G Dl we have on ri(z) with [S^^v)-high probability 

K{z) c{^^ + {\ogNf^-^{z)^, (3.32) 

max|Z,(z)| ^/" (log^)^ + (log jV)'^^(z)) . (3.33) 
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Proof. Wc start with (|332l) . Let i ^ j. Using (|XT3)) . ([3^, (|330)) . and (|3l2l) wc get on n{z), with 
i/)-high probabihty, 



|G.,| ^ C{\h,,\ + \Z.,,\) ^ ^ + C 



< ^ + C(log A^)^« ^ + C(log A^)^« (i, 5:|Gif 

^ ^ ^ k,l 



(3.34) 



where the last step follows using p.22p and (j2.6p . Using p.l2p repeatedly and recalling p.29p . we find on 

'"fefe 



n{z) that G^^'^ = Gfcfc + 0{Al). Thus we get on n{z), by (IX^Tll . 



fe,/ 



(y) < Im"^ , 



(3.35) 



Taking the maximum over i 7^ j in p.34p therefore yields, on fl{z) with (^, i^)-high probability, 



A,, ^+o(i)A„ + G(log7V)2«./^^, 

where we used (HH) and the fact that Ntj ^ (logTV)*? by This concludes the proof of ((X^ . 

In order to prove (|3.33p . we write, recalling the definition p.isp . 



(i) 



Using (12201), (Emi), and we therefore get, on n{z) with (^, i^)-high probability, 



\ZA ^ 



9 

similarly to above. Invoking p.32p and recalling p.2p finishes the proof. 

We may now estimate in terms of A. 
Lemma 3.14. Fix z = E + irj £ D^. On i7(z) we have with (^, v)-high probability 

max|G,,(z)-m(z)| ^ c(^^^^p^ + {log NY^^{z)^ . 



/Imm 



□ 



(3.36) 
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Proof. Wc use the resolvent equation (|3.23p . On Q,{z) we have \Ai\ ^ CA^ and \hii\ ^ C/q with i/)-high 
probabihty by Lemma 13.71 Thus, Lemma 13.131 yields on n{z) with t^)-high probability 



|T,| ^ c( ^^^^^^ + (logiV)2«*(z)^ < 1. (3.37) 



From p.23p we therefore get on n{z) with (^,zy)-high probability 

\Gu-G,,\= |G,,||G,,||T,-T,| c(^^i^^^ + (logiV)2«*(z)j . (3.38) 

Since m = Gjj , the proof is complete. □ 

Note that p.36p implies 

Adiz) ^ Aiz) + C + (log 7V)2«M/(z)^ . (3.39) 
on Vl{z) with (^, i/)-high probability. 

3.3. Stability of the self-consistent equation of \v\ on J7(z). Wc now expand the self-consistent equation 
into a form in which the stability of the averaged quantity [v\ may be analysed. Recall the definition 
Vi Gii — rrisc- 



Lemma 3.15. Fix z e D^. Then we have on f2(z) with {£^,i>)-high probability 

A2 \ 

(3.40) 



(l-mL)H = mLbP+mL[Z] + o( (i^S^^ + (logiV)^«+i*^ + ^ 



Proof. Recall that on Jl(z) we have m = o(l). Moreover, (|2.13p and p.8p imply that |msc(2) + z| = 
|msc(2)|~^ ^ c for z G Dl- With p.37p we may therefore expand p.23p on ^}{z) up to second order to get, 
with (^, j/)-high probability, 

iH = mlilv] - T,) + mUiv] - T,)^ + 0(H - T,)^ . (3.41) 
Averaging over i in p.4ip yields, with (^, :^)-high probability, 

(1 - ml)[v] = -ml[T] + mLH' - 2mLM[T] + m^JT^] + 0{[v] + max|T,|)' . 
Recall the definition of Ai. Using ((XT^ and ^j^^ we find on 0(z), with (^,i/)-high probability, 



[T] = ^T.'^.-[Z] + iA] = -iZ] + of + ± ^ + 



2J' 



where in the last step we used p.27p . Moreover, recalling that \[v]\ ~ A, we get by Young's inequality 

-2mLM[T] = o(^^ + (logA^)|[T]P 
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Recalling p.37|) . we therefore have 



^Uv? + +0 (^"S^/^""' + (log jV)4^+ivi/2 , A' 



where we used that on n{z) we have A ^ ^ (logAf)^^ ^ (logAf)^^. □ 
Note that, together with (|3.33p . Lemma [3.151 implies a weak self-consistent equation on [v]: 

(l-mL)H = ml[v? + o(-^)+o(^^^^ + {\ogNr^]. (3.42) 



on 0(z) with (^, ;/)-high probability. Here wc used (|2.6p and (|3.2p . For the proof of the weak semicircle law, 
Theorem 13. 1[ wc shall only use the weaker form p.42p of the self-consistent equation. 



3.4. Initial estimates for large r]. In order to get the continuity argument of Section 13.61 started, we need 
some initial estimates on A^; + Aq for large ry. In other words, we need to prove that fl(E + irj) is an event 
of high probability for 77 ~ 1 . 



Lemma 3.16. Let ?/ ^ 2. Then for z = E + irj e we have 

3giV)^ ^ C(lo^ 

q VN 

with {^,iy)-high probability. 

Proof. Fix z ~ E + irj E Dl with 77 ^ 2. We shall repeatedly make use of the trivial estimates 

|g!?I ^ l^mi Kcl ^ (3.43) 

■' rj rj Tj 

where T C {1, . . . , is arbitrary. These estimates follow immediately from the definitions of G^'"' and rrisc- 
We begin by estimating Aq. For i 7^ j we get, following the calculation in p.34p and recalling (|3.27p . 
with (^, i/)-high probability. 



q \j Nr] q N 

Taking the maximum over i ^ j yields, with (^, z^)-high probability, 

q Vn 

What remains is an estimate on Ajj. We begin by estimating, with (^, ;^)-high probability, 

|T,| ^ ^ + \z,\ + \A\. 
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In order to estimate \Ai\, we observe that p.l3p implies 



Therefore we have, with zy)-high probabihty, 



lAI ^ ^\Gu\ + ^f2\Gfj\\G,,\{}h,,\ + \Z,,[) ^ ^ + CAji+sup|Z,,|) ^, (3.44) 



where we used that with i^)-high probabihty 

|z„| {\ogNf^ 



C ko c 



q VN 



as follows from (j?:^ and (|X?7)) . Similarly, from (|XT5)) and using ([X^ and (j?:^ . we find with z^)-high 
probability 



\ZA < 



C(logiV)« C(logiV) 



2? 



Thus we have proved that with (^, i/)-high probability |Tj| C{logN)^q-'^ + C(log iV)2«Ar- V2, 
Next, using (|2.13p . we write the self-consistent equation p.23p in the form 



(z + rrisc + [v] - Tt){z + rrisc) 
The denominator of p.45p is with (^, i')-high probability larger in absolute value than 

(2-l-0((log7V)«g-i + (log7V)2«7V-i/2)^2 ^ 3/2, 

^ 2 and IHI 1 by ([QS]) . Thus, 

Arf + 0((log iV)«g-i + (log 7V)2«iV-i/2) 



(3.45) 



Since z + m 



3/2 



which yields, after taking the maximum over i, 

Ad + 0{{\ogN)^q-^ + (logiV)2fAr-i/2) 



Ad < 



3/2 



This completes the estimate of A,;, and hence the proof. 



□ 
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3.5. Dichotomy argument for A. The following dichotomy argument serves as the basis for the continuity 
argument of Section 13.61 

We introduce the control parameters 



1 — mi 



V9 



(iV?7)i/3 



(3.46) 



where a = a{z) and /3 ~ depend on the spectral parameter z. For any z e we have the bound 
/3< (logiV)-«. 

From Lemma l3.2l it also follows that there is a constant constant K ^ \, depending only on E, such that 



1 



y/K + rj ^ a{z) < Ky^K + T] 



(3.47) 



for any z G D^. 

We shall fix E and vary rj from 2 down to (logN)^ N^^ . Since + rj is increasing and P{E + ir/) 
decreasing in 77, we find that, for any [/ > 1, the equation ^/k + 77 = 2U'^Kj3{E + ijy) has a unique solution 
ry, which we denote by 77 = rj{U,E) (recall that k = — 2| is independent of rj). Moreover, it is easy to see 
that for any fixed U we have 

^ < 1 . (3.48) 

Lemma 3.17 (Dichotomy). There exists a constant Uq such that, for any fixed U ^ Uq, there exists a 
constant Ci{U), depending only on U , such that the following estimates hold for any z ^ E + irj lE D^: 



A{z) ^ UP{z) or A{z) ^ 



a(z) 
U 



A{z) ^ Ci([/)/3(z) 
on Q{z) with (^,iy)-high probability and for sufficiently large N. 
Proof. Fix z ^ E + iri e Dl. From p.42|) and Lemma [5^ we find 



if 77 > viU,E) 
if 77 < ?^{U,E) 



(3.49) 
(3.50) 



-1 7 

1 — mz„ 



O 



(log7V)« 



with (^, ;/)-high probability. The third term on the right-hand side is bounded by C*(/3A + a/3 + /3^) for some 
constant C* ^ 1. Wc set Uq := 9(C* + 1). We conclude that in il{z) we have with (^, ;/)-high probability 



1 — mi 



V - V 



^ O 



A2 



log 



+ C*(/3A + a/3 + /32). 



(3.51) 



Depending on the size of /3 relative to a, which is determined by z, we shall estimate either [v] or [v]'^ 
using p.5ip . This gives rise to the two cases in Lemma [3. 171 

Case 1: rj ^ rj. From the definition of 77 and C* we find that 



a a 

Recalling that A = \ [v]\, we therefore obtain from p.5ip with (^, zy)-high probability 

aA 



(3.52) 



aA 2A^ + C*(/3A + a/3 + /3^) 2A^ 



2C*a/3, 
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which gives 

Thus, either ak/2 s$ AK^ which imphes A ^ a/S ^ a/U, or aA/2 4C*q;/3 which imphes A 8C*,3 C/;3. 
This proves ^^Ml- 

Case 2: rj < rj. In this case the definition of rj yields a ^ 2U^K^/3. We express — A^ from (|3.5ip and 
we get 

A^ 2aA + 2C*if3A + al3 + /3^) ^ C'pk + C'P^, (3.53) 
for some constant C depending on U . Now p.SOp is an immediate consequence. □ 

3.6. Continuity argument: conclusion of the proof of Theorem 13.11 We complete the proof of Theorem 13.11 
using a continuity argument in ?; to go from r] = 2 down to 77 = iV~^(log A'^)^. We focus first on proving 
p.Sp . We use Lemma [3. 161 for the initial estimate, and the dichotomy in Lemma [3. 171 to propagate a strong 
estimate on A to smaller values of rj. 

Choose a decreasing finite sequence 77^, = 1, 2, . . . , fcp satisfying fcg =^ CN^, — r]k+i \ ^ N^^, rji = 2, 
and r^fcp = 7V~^(log A^)^. We fix i? € [— S, E] and set z^. := E + Throughout this section we fix a [/ Uq 
in Lemma [3.171 and recall the definition of rj{U, E) from Section [5751 

Consider first zi. It is easy to see that, for large enough N, we have 771 ^ rj{U,E), for any E E [— E,E]. 
Therefore Lemmas 13.161 and 13. 171 imply that both il{zi) and 

A(zi) ^ [//3(zi) 

hold with (^, 7^)-high probability. This estimate takes care of the initial point rji. The next lemma extends 
this result to all fc ^ fco- 

Lemma 3.18. Define the event 

nk := n{zk) n {A{zk) c^''\u)Pizk)} , 

where 



Then 



U tfr,k>viU,E) 



P(fi^) ^ 2fce-'^(l°«^)^ (3.54) 

Proof. We proceed by induction on k. The case fc = 1 was just proved. Let us therefore assume that p.54p 
holds for k. We need to estimate 

n^l+^) ^ ^{^k n ^{zk+i) n 0.%+^) + P(i]fc n {n{zk+i)f) + fini) = B + A + n^i) , (3.55) 

where we defined 

A 



B := 



ilk n \^Ad{zk+i) + Ao{zk+i) > (log^)-«|] , 

nk n n{zk+i) n {A{zk+i) > c<''+^'>{u)(3{zk+i)}] . 
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dG,,{z) 



dz 



^ N^^ sup 



-6 



(3.56) 



We begin by estimating A. For any we have 

\Gij{zk+i) - Gij{zk)\ \zk+i-Zk\ sup 
Therefore, by p.32p and (|3.39l) . we have on fife with (^, ;^)-high probability 

Adizk+i) + Ao{zk+i) ^ Arf(2fe) + A„(2fe) + 27V-'^ 5=: g|^ (logJ^)% (iogjv)2gvi/(zfe)^ +A(zfe) 

s; CPizk) « (logiV)-«. 

Thus we find that A e-'HiogN)\ 

Next, we estimate B. Suppose first that rjk ^ ri{U,E). Then, similarly to (|3.56p we find |A(zfe+i) 
A(2fe)| ^ A^~^. Thus we find on fife with (^, z^)-high probability 



R fi 3[/ 

A(zfc+i) < A(zfe)+iV-6 ^ (7/3(zfe)+iV-6 ^ _^(^^^^) 



a(zfc + l) 
U 



(3.57) 



Now the 



Suppose now that rjk+i ^ rj{U,E). Then from p.57p and p.52p we find A(zfe+i) < 
dichotomy of p.49p yields on fife n fi(zfe+i) with (^, i')-high probability that A(zfe+i) ^ U(3{zk+i). On the 
other hand, if J^fc+i < rj{U,E) then p.57p immediately implies A(2fe+i) < Ci{U)f3{zk+i)- This concludes the 
proof of S Q-HiogN)'^ -f ^ ^-j^ 

Finally, suppose that rjk < rj{U,E). Thus we also have < rj{U,E). In this case we immediately get 
from p.50p on fi(zfe+i) with (^, z^)-high probability A(zfe+i) < Ci{U)f3{xk+i)- 

We have therefore proved, for all k, that P(fife.+i) < 2e-''(i°s^)'^ + P(fi^), and the claim follows. □ 

In order to complete the proof of Theorem 13.11 we invoke the following simple lattice argument which 
strengthens the result of Lemma l3.18l to a statement uniform in z € D^. The main ingredient is the Lipschitz 
continuity of the map z i— > Gij{z), with a Lipschitz constant bounded by ri~^ ^ 7V^. 

Corollary 3.19. There is a constant C such that 



U (^w)' 



.zeDL 



U {a(z) > Cl3{z)} 



LzeDL 



< -i.(logAr)« 



(3.58) 



Proof. Take a lattice C C I?l such that \C\ ^ CN^ and for any z e Dl there is a z G £ satisfying 
|z — z| ^ N"'^. From the definition of G it is easy to see that for z, z e Dl 

1 



|G.,(z)-G.,(z)| ^ ij-'lz^Sl ^ 



N 



The same bound holds for |m(z) — m(z)|. Moreover, Lemma [3.181 immediatelv yields 



f][A{~z)^^m} 



(3.59) 



(3.60) 



for some C large enough and some > 0. From p.59p . p.60p . and ^ ^ /3(z) we get 



U {a(z) > C/3(z)} 



zEDl 



The first term of p.58p is estimated similarly. 



□ 
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We have proved (|3.5p . In order to prove p.3p . we note that p.Sp . I|3.32p . and p.58p imply 

with i/)-high probabihty. Now a lattice argument a analogous to Corollarv l3.19l vields p.3p . The diagonal 
estimate p.4p follows similarly using p.36p . This concludes the proof of Theorem 13.11 



4. Proof of Theorem 12.81 

In the previous section we proved Theorem 13.11 which is weaker than the main result Theorem 12.81 (strong 
local semicircle law) , but will be used as an a priori bound in the proof of Theorem 12.81 The key ingredient 
that allows us to strengthen Theorem 13.11 to Theorem 12.81 is the following lemma, which shows that \Z\ , 
the average of the Zi's, is much smaller than of a typical Zi. (Notice that in the proof of Theorem 13.11 to 
arrive at p.42p . [Z] was estimated by the same quantity as each individual Zi.) This lemma is analogous to 
Lemma 5.2 in [U and Corollary 4.2 in [33], but we will present a new proof (in Section 1^75)) . which admits 
sparse matrix entries and effectively tracks the dependence of the exponent p. Our new proof is based on an 
abstract decoupling result. Theorem 15.61 below, which is useful in other contexts as well, such as for proving 
Proposition 17.111 below. 

Lemma 4.1. Recall the notation [Z] = - Zi. Suppose that ^ satisfies (|2.4[) . q ^ (log A^)^^ and that there 
exists D C Dl with L ^ 14^ such that we have with {^,i>)-high probability 

A{z) s; 7(z) for zeD, (4.1) 

where J is a deterministic function satisfying j{z) ^ (iogN)"^. Then we have with 2^ v) -high probability 

\[Z]{z)\ < (logiV)-^ + (A^ + (log7V)^C ^"^--^)+^(-) ) forz^D. (4.2) 

In particular, by (j3.40p . we have with (^ — 2, v)-high probability 

for any value of the spectral parameter z Cz D. 

The proof of Lemma 14.11 is given in Section [5j In this section we use it to prove Theorem 12.81 and to 
derive an estimate on ||£fj| f Lemma 14. 4p . 

The basic idea behind the proof of Theorem 12.81 using Lemma 14.11 is to iterate (|4.2p in order to obtain 
successively better estimates for A. Each step of the iteration improves the power 1 — r of the control 
parameter {q^^ + (iV?;)^^)^^'^. The iteration is started with the weak local semicircle law. Theorem 13.11 
which yields 1 ~ t — 1/3. At each step of the iteration, r is halved at the expense of reducing the parameter 
^ to ^ — 2, thus reducing the probability on which the estimate holds. This iteration procedure is repeated 
an order log log times, which allows us effectively to reach r = 0. 

The iteration step is based on the following lemma, which is entirely deterministic. 
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Lemma 4.2. Let 1 ^ ^ md q > 1. Let < r < 1 and L > 1. Suppose that there is a number 7(2;) 
satisfying 



1 1^1-- 



7(^) ^ (log7V)i9«^ ( ^ + ) for zeDl (4.4) 



such that (|4.ip /lo/ds TOt/i 13 := D^- W^e also assume that 



^ \v\ - \v\ 



A 



2 



< C— + C(log I - + + (log W)*" I far z£Dl, (4^5) 

where a was defined in (j3.46p . Finally, we assume that if rj ^ 1, i/ien 

A(z) < 1. (4.6) 
Then we have ^ 

A{z) ^ (logiV)i9«^ ^ ^^-^^ 

for z G and large enough N . 

Proof. The proof is based on a dichotomy argument. Define 

ao(^) (logAf)(i«+5)«^ + ^ . (4.8) 

We consider two cases. 

Case 1: a ^ fOao- Using the estimate (|4.4p we find 

Now in (|4.5p we may absorb the term A^/logiV into the term on the left-hand side, at the expense of 
a constant 2. Then we complete the square on the left-hand side and take the square root of the resulting 
equation; this yields 

, X I-t/2 , 



A ^ Aa + C {log N)-^^ + +C(logiV)««y— , (4.10) 

where we used (|4.9p . Now (|4.7p follows from (j4.10p . 

Case 2: a ^ lOag. Let us assume that A ^ a/2. Then in (|4.5p the terms [u]^ and A^ can be absorbed into 
the term Q!|[w]|, so that we get 



By the definitions of 7 and ao, we have 
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and the first term in the right-hand side of (|4.1ip is bounded by a/ log N thanks to (j4.8[) . The last term ean 
be estimated similarly. Hence (|4.1ip implies that A ^ a/4 provided that A ^ a/2. 

In other words, if a ^ lOao then either A > a/2 or A ^ a/4. Using the continuity of A(z) and a = a{z) 
in T] = Imz, and the assumption 

A(z) < 1 = 0(a) 

for 77 ~ 1, we get A ^ a/4 on the whole D^. Together with (|4.1ip . we obtain (|4.7p . □ 

Proof of Theorem 12.81 The main work is to prove Theorem 12.81 for spectral parameters z G Dl, where 

L := 120^ (4.13) 

Once this is done, the extension to all z E D is relatively straightforward, and is given at the end of the 
proof. Recall the definition p.26p of 5* and p.25p of n{z). It is clear that if D is replaced everywhere by 
Dl then ((2l7| follows from ((2J6)) . ((3321) and ((336)) . Therefore we only need to prove ((2J6)) . 
We begin by introducing 

^ := 2(loglogA^/log2)+e. (4.14) 

By the assumptions ((^T5|) and (gUS]), we have f < 3^/2 AologlogiV, L ^ 60^, and q ^ {logN)^°^. To 
prove (|2.16p with D replaced by Di, it therefore suffices to establish 



(4.15) 



with (^, i')-high probability. 

The weak local semicircle law, Theorem 13 . 1 1 with ^ replacing ^, yields 



A ^ {logNf Q + ^)^' ^ (logiV)i^« Q + ^) for z G D, (4.16) 

with (^, zy)-high probability. Thus (|4.ip holds with 

7(z) := (log7V)i«^^Q + ^y^'. (4.17) 

With L ^ 60^ and q ^ (logiV)^°^, we also have 7 ^ (logA^)"^. Thus Lemma l4.ll implies that, with ^ 
replacing ^ and D ~ D^, the statement 



1 — m 



2 



A^ ,^.^a7 ( 1 1 „ _s/irlmm„ 



(4.18) 

holds with (^ — 2, ;^)-high probability. This implies ((4.5p . with the choice ^1 = ^ in ((4.5p . since Im m^c ^ Ca. 
Moreover, 7 satisfies (|4.4p with ^2 = ^ and r = 2/3. We also find that A satisfies (|4.6p . since A ^ 7 ^ 
(logTV)"^ (see (|4.16p ). We may therefore apply Lemma 1^1^ with ^1 = ^2 = C to get that 

-/I 1 ^'"'/^ 



A < (logiV)^^^ ( - + ) for zeDl (4.19) 

\9 Nrj; 
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holds with (^ — 2, !^)-high probabiUty. Wc now repeat this process M times, each iteration yielding a stronger 
bound on A which holds with a smaller probability. After M iterations we get that 



A<(i„givrM- + ^ 



2 1 ^ 3 2 ^ 



for z e Dl 



(4.20) 



holds with — 2M, :/)-high probability. 

To clarify the iteration, we spell out the details of the second step. We start from (|4.19p and define 7 as 
the right-hand side of (|4.19p . 



(logiV)-^ ^- + - 
Thus Lemma |4.1[ with ^ — 2 replacing ^, implies that 



1-1/3 



1 ~ mi 



.[v]~[v] 



< C 



A2 



log A'' 



+C(logAr) 



i4(e-2) 



(log^r)4(C-2)Im^.c + 7 
Nrj 



(4.21) 



for z G Dl 



(4.22) 

holds with (^ — 4, i/)-high probability. We now apply Lemma [4.21 with Ci = C ^ 2, ^2 = C a-nd r = 1/3. 
(Similarly, in the fc-th step we set ^1 = l~ 2{k - 1), ^2 = and t = (2/3)(l/2)''"^) This shows that 



A < (logiV)i9« ( - + 



1 1 



q Ni] 



1-1/ 



for z G Dl 



(4.23) 



holds with (^ - 4, zy)-high probability. This is for M = 2. 

Now we return to (|4.20p and choose M :~ [log log N/ log 2] — 1 (where [•] denotes the integer part). Using 
g-i + {Nri)-'^ > cAf-1/2 (by (gH)), we get 



1 1 

q Nrj 



C (logAr)^/^ 



Thus 



A (logAr)f « (- + —] for z e Dl 



(4.24) 



39 tr 1 1 

q Nr] 



holds with (f + 2, i/)-high probability. Recalling p.47p we find that (j4.24p implies (|4.15p . unless 

(4.25) 

Let us therefore assume that (|4.25p holds. Then it remains to prove that with (^, i')-high probability 

for zeDL. (4.26) 



A ^ (logAr)4o«^ + (i^)2oe 1 



Defining 7 as the right-hand side of (|4.24p . we use Lemma |4J1 with ^ -I- 2 replacing ^, to get 



1 — mi 



A 



r/ 1 



< C-— - + C(log7V)i«« {^ + — 



logN 



g2 Ni-j 



(4.27) 
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with ;^)-high probability, where we used (|4.25p and | lmmsc{z)\ ^ Ca(z). We can estimate the term [w]^ 
by and so that 

aA = a\[v]\ ^ C(log7V)39«l + C(log7V)i««-^. (4.28) 

This yields (|4.26p and hence completes the proof of (|2.16p with D replaced with D^. (Recall the simple 
lattice argument of Corollarv l3.19l ) 

What remains is to extend (|2.16p and (|2.17p from zgD]^ to zeD. Let us therefore assume that 



z 



^ E + ir] e D with < 77 77 := (log N)^N ^ . For any i, j = 1, . . . , we get the bound 



\G,,{E + iv)\ 



y-v Ua{i)Uaij) 



^ max 



Aq - z\ 



Xa — Z 

We define the dyadic decomposition of the eigenvalues 

Uo {a : |A„ ^ E\ < r^} , L/fe {a : 2'=-i77 |A„ - E\ < 2^/} (fc ^ 1) . 

This yields 

Next, we break the summation over k into three pieces delimited by fci := max{fc : 2'°?7 < rj} and fc2 := 
max{fc : 2*^77 < 3}. By spectral decomposition, it is easy to see that the function y ^ y lmGii{E + iy) is 
monotone increasing. Therefore we get 

fcl ~ k2 00 ^ 

^ Im Gu {E + 12^) «S E ^ Ii^^ + i'') + E Ii^^ + + E ^ 

fc>0 fe=0 ^ fc=fel + l fc=fc2 + l ^ 

^^i^+C(..-fc.) + C^ 
Nr] 

with (^, i')-high probability, where in the second step we used p.l7p for z G Dl. Therefore we have proved 
that 

max|G,,(i? + ir,)| ^ \ ' 
with (^, i')-high probability. This concludes the proof of Theorem 12.81 □ 



4.1. Estimate of \\H\\. In this section we derive an upper bound on the norm of H. A standard application 
of the moment method yields the following weak bound on Its proof is given in Appendix 1X1 

Lemma 4.3. Suppose that H satisfies Definition \2. 11 that ^ satisfies p.4p . and that q satisfies p.6p . Then 
with {S^,v)-high probability we have 

\\H\\ 2 + {\ogNfq-^''^ . (4.29) 
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Using the local semicircle law, Theorem 12.81 we may prove a much stronger bound on Lemma 14.31 

will be used as an apriori bound in the proof of Lemma 14.41 

Lemma 4.4. Suppose that H satisfies Definition \2.1[ and that ^ and q satisfy (|2.15p . Then with {^,iy)-high 
probability we have 



\\H\\ ^ 2 + (logA^)^« (<7-2+7V-2/3^ _ 



(4.30) 



Proof of Lemma 14.41 We only consider the largest eigenvalue Xn = max^ Aq,; the smallest eigenvalue Ai 
is handled similarly. Set L ~ 120^. Using (j2.16p with ^ + 2 replacing ^, we get with (^ + 2, i/)-high probability 



A(z) (logiV) 



41? 



Then applying, Lemma l4 . 1 1 with 



1 1 
1 1 



:= (log7V)-?(_ + _ 



and 1^ + 2 replacing ^, we have with (^, i')-higli probability 



I — mi 



^ C 



A2 



logiV 



Imm, 



q2 (7V,^)2 



for z e Dl , 



where Ci is a sufficiently large constant. Now if > 2 and k ^ ij, then Lemma [3^ and p.47p yield 



1mm, 



Inserting ()4.34p into (|4.33p . we find with (^, z^)-high probability 

1 - mL r 1 



V - V 



^ C + C(\osNf'^ (—+ ^ + ^ 



Next, for any fixed Ci > 0, wc can find a large enough constant C2 > 2Ci such that if E satisfies 

2 + (log Nf^i (q-'^ + N-^/^'j E ^ i 

then 

min{7V-i/2;,i/4^ N-^n'/^q^, k} ^ [log Nf'^+^ N-\-'^\ 
(Here k = ke = E — 2.) From now on wc assume that E satisfies (|436)) . We define 

r]^7]E ■■= ilogNf'^+^N-^K-~^^\ 
Note that 77 depends on E via k. From (|4.37p wc have 

K ^ r] . 

Using gSZI), fOSl) . and we get 



lmmsc{E + irj) . 



(4.31) 
(4.32) 

(4.33) 

(4.34) 

(4.35) 

(4.36) 
(4.37) 

(4.38) 
(4.39) 

(4.40) 
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Similarly, using (|4.37p we have 

Next, with the lower bound a ^ y^/K from (|3.47p and (|4.39p we find, using (|4.3ip . that 



q Nrj 

with (^, i^)-high probability, where we used (|4.36l) to obtain the first term q^^ on the right-hand side and we 
used Nri^/n ~ (log A^)'^i^+^ (see the definition (|4.38p of rj) for the second term. Now we can assume 

q ^ (logA^)'^-^« (4.43) 

for some large C3 > (otherwise (|4.30l) holds for some constant C by Lemma [4. 3p . We have E + irj G Dl 
(recall that E satisfies (|4.36p V Using (|4.42p . we can neglect the terms and [v]'^ in (|4.35p to get, with 
(5, i/)-high probability. 

Since a ^ K^/k, the last term is bounded by 

^ (iog7V)-c'i«-i^ ^ (iog7V)-c'ie-i 1 



where we have used (|4.40p . The first term on the right-hand side of (|4.44p can be estimated similarly using 
(|4.4ip and (|4.37p . Finally, the middle term on the right hand side of (|4.44p can be estimated by using (|4.42p . 
Putting everything together, we obtain, for any E satisfying (|4.36p . that 

A(z) < for z = £: + i7; e (4.45) 

Nrj 

with (^, i')-high probability. Furthermore, with (j4.34p and (|4.40p . we obtain that for any E in (|4.36p 

ImTO(z) ^ Immsc(z) + A(z) <C — for z^E + u]eDL (4.46) 

Nrj 

with (^, i')-higli probability. Since 

= ^E(X-i)TT^' (4.47) 

we have 

1mm z > 

^ ^ Nr] 

if there is an eigenvalue in [E — ri,E + T]]. Then ()4.47p and (|4.46p imply that, for any E satisfying (|4.36p . there 
is no eigenvalue in [E — r], E + 77] with (^, i^)-high probability. The regime ^ 3 is covered by Lemma 14.31 
This completes the proof. □ 



5. Abstract Decoupling Lemma and Applications 

In this section we prove an abstract decoupling lemma which is independent of the random matrix model. 
We shall apply this abstract result to random matrices in Sections 15.21 15.31 and 17.41 
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5.1. Abstract decoupling lemma. Throughout this section we use the letters A and B to denote abstract 
random variables. Note that A in this context has nothing to do with the matrix A from Definition [521 We 
work on the probability space generated by the N x N random matrices H. Let (A^) be a family of random 
variables indexed by subsets U C {1, . . . , A^}, and denote A := AI®] . For U C S C {1, . . . , iV} we define the 
random variable 

^S,U _ ^(_i)|T|^[(S\U)UT] ^ (-1)I^U[^1 . (5.1) 

TCU V:S\UCVCS 

Lemma 5.1 (Resolution of dependence). For any § we have 

A = ^A^'U. (5.2) 

ucs 

Proof. The proof is a standard inclusion-exclusion argument. □ 
Definition 5.2. Let A := A{H) be a random variable. Then we define the new random variable A^"^"^ through 

A^^\H) := A{7rj{H)), (5.3) 

where ttj was defined in p.lOp . 

Remark 5.3. Note that the operation (.)(^) is compatible with algebraic operations in the sense that 

(A + i?)m = (AS)m = A^i^m. (5.4) 

Since ttu o Try = ttuuv we also have (AC^))^^) = A^^'~'"^l 

Remark 5.4. The matrices i/^^^ and G'-'^^ defined through (|5.3p are x iV matrices. We adopt this 
convention only in this section. This is in contrast to Definition 13. 3[ where the same notation was used for 
the (A''— |T|) X {N — |T|) minors of the same matrices. This slight abuse of notation will not cause ambiguity, 
however, because we shall only consider matrix elements H^j and Gl^ for i, j ^ T; for these matrix elements 
the two definitions coincide. 

Definition 5.5. We say that a random variable A is independent of the set U C {1, . ■ . ,N} if A = A^^^ 
(or, equivalently, if A is independent of the family (hij : i e U or j e U)J. 

We shortly explain the idea behind these definitions. In many applications we choose :— so 
that is independent of U. In this case, the decomposition (|5.2p can be interpreted as follows. We first 
fix a reference set S. From (|5.ip it is clear that A^'^ is independent of S\ U, i.e. it depends only on the set U 
(among the variables in §). Therefore (|5.2p can be viewed as a resolution of dependence of A on subsets of §. 
We shall see that when we apply this decomposition to resolvent matrix elements, i.e. set A = Gij, then G^^"^ 
will be comparable in size with a product of at least |U| + 1 off-diagonal resolvent matrix elements, which are 
small with high probability. Hence, in this case, the decomposition (|5.2p is effectively a graded resolution 
with a trade-off between dependence and size. A larger U means that G^j*^ is smaller, but it depends on 
more variables. For smaller U's we will exploit that G^'^ is independent of more variables. 

The purpose of this graded decoupling is to obtain large deviation estimates on the average [Z] :— Zi 
of N weakly dependent centred random variables Zi. The precise result is given in Theorem l5.6l below. Before 
stating it, we outline the main ideas. 
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In our applications, the covarianccs between different variables Zi arc too large to be controlled in terms 
of their variances, and hence standard methods for sums of weakly dependent random variables relying on 
such ideas do not apply. Instead, the weak dependence will be expressed in terms of the smallness of 2^''^ for 
large U; the size of zf '"^ reflects how strongly Zi depends on the set U. The basic strategy is a high-moment 
estimate 

ii ,...,ip 

on some high-probability event S, whereby each term Zi. is expanded according to the graded expansion 
of (|5.2p . The right-hand side is controlled using the two following facts, (i) zf''^ is small for large U (weak 
dependence of Zi on U). (ii) The expectation vanishes if all factors are independent. Note that this graded 
expansion differs from the conventional martingale-type arguments used to establish central limit theorems 
for correlated random variables. 

The basic idea of a graded expansion to control large deviations of sums of weakly dependent random 
variables was introduced in Lemma 5.2 of [22] in context of Wigner matrices. This result considers the special 
case Zi = Zi (as defined in p.l5|) ) and uses expansions in full rows and columns to detect dependencies. For 
the applications in [22], only large but A^-independent powers p were considered. Hence in [22| it was not 
necessary to keep track of the p-dependence or the probability of S. 

A new proof was given in Lemma 4.1 of |23j . where the p-dependence and the probability of S were 
tracked precisely. This proof relied on an expansion in terms of invidual matrix elements and not full rows 
and columns. Thus the expansion was more economical but its combinatorial structure was considerably 
more involved. 

In this paper we present an abstract generalization of the row and column expansion method of |22j . It is 
formulated for an arbitrary family of random variables Zi, . . . , Zj^ . As input, it needs bounds on the terms 
of the graded expansion of Zi. The abstract formulation thus streamlines the argument by dissociating two 
unrelated steps of the proof: (i) the moment estimate using the graded expansion (a probabilistic estimate 
given in Theorem l5.6p and (ii) controlling the size of the graded terms for a concrete application (in the case 
of resolvent matrix elements, a deterministic, almost entirely algebraic, argument given in Section [5. 2p . 

For our purposes, this increased generality is needed for two reasons. First, it allows for an efficient 
control of the strong fluctuations associated with sparse matrix entries. Second, we use it to control the 
average of not only Zi (Lemma I5.13P but also quantities like ()7.27|) with a different algebraic structure. In 
the special case Zi = Zi and q = N^l"^ (Wigner matrix), our result reduces to that of Lemma 4.1 in [23] . 

Theorem 5.6 (Abstract decoupling lemma). Let Zi, . . . jZ^f he random variables and recall the nota- 
tion 

i=l 

Let S be an event and p an even integer. Suppose that there exists a family of random variables (z|'^')i_u 

indexed by i Cz {1,...,A^} and U C {1,...,A^} satisfying i ^ V, such that z]"' = Zi and the following 
assumptions hold with some constant C . 

(i) Recall the partial expectation from Definition ] 3. (A For i ^ U we have that zf^ is independent ofV 
and 

E,zf = 0. (5.5) 
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(ii) (U' -norm in ^j. For any U, § with U C § and i ^ S we consider Zf'"^ defined by (jS.ip from the family 
zf^^ . Then for any numbers r ^ p with |S| ^ p we have 

E(l(S)|ZPr') s; {Y{CXu)y , withu:=\lJ\ + l, (5.6) 

where X and Y are deterministic and X satisfies 

(Hi) (Rough bound on the L^-norm in Define 

[S], (7r-io7rO(S). (5.8) 
For any V, S satisfying U C §, i ^ §, and |§| ^ p we have 

E(l([S],)|zf"|2) N^P. (5.9) 

(iv) (Rough bound on Zi). For any U we have 

l(S)|zfl| < YN^ . (5.10) 

(v) (H has high probability) . We require that 

Then, under the assumptions (i) - (v), we have 

p(l(5)|[Z]| ^/2y(x2 + 7V-i)) <; ^ (5.12) 

for some C > and sufficiently large N . The constant in (|5.12p depends on the constants in (j5.6p . (|5.10p 
and ([5lT|) . 

The key assumptions in Theorem 15.61 are (i) and (ii); the key (smaU) parameter is X. Assumption (i) 
simply ensures that all terms of the graded expansion of Zi have zero expectation. Assumption (ii) defines 
the decay of Z^'"^ in the size of U; roughly, it states that 

l^-s.Uj < j^|U|+l 

in the sense of high moments. This is in accordance with the principle outlined above that terms of the 
graded expansion which depend on many variables have a small size, while those which are independent of 
many variables may be larger. The parameter Y is trivial in our applications, where we shall take it to be a 
logarithmic factor. In Lemma 4.1 of [25, the role of X was played by the parameter defined in p.26p . 
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Proof of Theorem 15.61 We find 



JV 



(5.13) 



Ql .0.2 , ■ . ..Qp — 1 



i=i 



where # stands for either nothing or complex conjugation. Let ol ~ (ai, . . . ,ap) and define § S{ol) '■- 
{ai, a2, • ■ • , Qfp}. Then we have 



E(1(S)|ZP) s$ N-PpPy^ N' max 

^ a:|S(a)| = 



Abbreviating := § \ {a_,} we find from (|5.2p that 



(5.14) 



(5.15) 



Thus ((5T4)) implies 



E(1(S)|Z|P) ^ N^PpPy^N' max 

^ a:|S(a)|=,s 



e|i(s) e ••■ e nt^-^'j'-''^- 



(5.16) 



(abbreviating A^'^ = ]4^^"). Writing := U {a^} we have 



E(l(S)|Zr) (^)'^^iV^s"nPmax 

s— 1 n— 1 



Now we claim that 



N's'^nP max 



|§(a)|=s,U;. C§,,^|U,| 



for some C > 0. Then inserting (j5.18p into (|5.17p . we find 

E(i(s)|z|J') 5$ (Cp"y(x2 + iv-i))^ , 

which implies (|5.12p by Markov's inequality. 

It only remains to prove (|5.18p . We consider two cases: ti ^ 2s and ?i ^ 2s — 1. 
We begin by proving (|5.18p for the case n ^ 2s. Using Holder's inequality we find 



(a)| = s,U;. C§,,f]|U,|=n|. 

(5.17) 

s$ {CNp^^YiX^ + N-^)Y 
(5.18) 



(5.19) 



i/p 



\{K[m\[z*p-^'. 



(5.20) 



Applying (|5.6p to the right-hand side, we obtain that 



He (l(S) YP{CnXr 



(5.21) 
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since J^j + l) = J2j — Combining (|5.20p . (|5.2ip and the factor n ^ p, we have bounded the 
left-hand side of (|5.18p as follows, 



c|E[l(S)n[Z#f-"A : |s(«)|=s,u;.c§,,f]|u;.| 

^ ^ .7 = 1 .7 = 1 



^ N''YP{Cn^Xsf'' 
^ (CiVpi°r(x2 + Ar-i))P^ 



where in the second inequality we used 



Cn^Xs ^ CXsV CXp" < 1 

(see (IS-Zp ) and n ^ 2s, and in the third inequality s ^ p and n ^ sp. This completes the proof of (|5.18p for 
the case n ^ 2s. 

Now we prove (|5.18p for the case n ^ 2s — 1. Fix sets U^- with = n, where we recall that 

Vj :~ V'j U {oij} and |lUj| = |U^| + 1. By definition of Uj, we have aj G Vj for all j. Since n ^ 2s — 1, we 
therefore find that there exists a k such that ak G Vk and ak ^ for j ^ k. By the definition (|5.ip and 
([531) . is independent of Sj \ V'j, i.e. of § \ Vj. We conclude that 



is independent of {ak}. Therefore, 
Thus, 







)-( 









i=i 



l([S]„JE„jZ#f-"'^ 



-E(l([SU,\S)n[2*f-' 
j=i 



which yields 



€ E|l([S]„,\S)n|[Z#f-"^- 
j=i 



E(1([SU\S) 



Since (|5.22p is independent of ak , we get 



j^k 

















i(2)n|[2t.f-"^- 


oo 




OO 





(5.22) 



(5.23) 



(5.24) 



(5.25) 



(5.26) 
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Using the definition of zf'^ in (|5.ip and (|5.10p , we have 



and 



where we used s ^ p and n ^ 2s in the last inequahty. Combining (|5.25p . (|5.26p . and (|5.28p . we get 



(5.27) 
(5.28) 



E(i(s)n[2f,f-"^ 

i=i 



<; (yArC)P-i22p|^iEi([s]„^\S)|[Z#f-"'^ 
Applying Schwarz's inequality on the right-hand side, we find 

1/2 



(5.29) 



E|i(s)n[2#f-"^ 

i=i 



(y7VC)P-i22p(p([S]„^ \S)) (El([S]„J|[Z#f-"'^ 
Using ((5?7| . ([ETT|) . ((O)) . and that n < 2s - 1 ^ 2p, we get, for any C > 

(y7v'^)j'22pp(s^)i/2 ^ yj'iv-^p. 

Since s ^ p, the proof of (j5.18p in the case n ^ 2s — 1 is complete. 



1/2 



(5.30) 



E|i(s)n[^f,f-''^- 



(5.31) 
□ 



5.2. Decomposition of Gij. In order to apply Theorem 15.61 to estimate \Z\, we need to derive bounds, and 
hence formulas, for the decomposition G^^-^ of resolvent matrix elements Gij. As usual, G refers to the 
resolvent of at a fixed spectral parameter z (which is suppressed in the notation), i.e. G = G{H) is viewed 
as a function of H . The main result of this section is the bound (|5.73p below. 
Note that the results in this subsection are entirely deterministic. 

Lemma 5.7. Let z ~ E + irj Cz D where D d C is some compact domain. Let U C {1, 2, . . . , N} and 

1 



|U| =^ 



(Ao + Ad)logAr 



Then for any i,j ^ U, we have 



|gJ' - TO,, S,, I C{l{t = j)Ad + A< 



In particular, if A^ + Aq (log N) ^ , then 



(5.32) 



(5.33) 



(5.34) 



Here the constants c and C depend only on D. 
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Proof. Define 



V, |V| =m} . (5.35) 



In the case m = 0, (|5.33l) follows from the definition of and A^. The estimate (j5.34p follows from (|5.33p 
noting that |msc(2:)| J5 c on a compact domain z Cz D with c depending on D. Next, from p.l2p we get 

^(T)^(T) 

G\f^ = G - ''^ , where i,j ^ {k}UT and k . (5.36) 
^kk 

Assuming (|5.34p for |1LJ| = m, we therefore obtain 

Bm+i s$ Brn + CoBl (5.37) 
for some constant Cq > independent of m. This implies that 

m 

B,n+i ^ CoY,Bl + Bo. (5.38) 

k=0 

By induction on m one obtains Bm ^ 2i?o as long as ComBQ ^1/2. □ 

In order to state the next result, we introduce a class of rational functions in resolvent matrix elements. 
Fix two sets U C § satisfying U 7^ 0. For fixed 71 G N let the following be given: 

(i) a sequence of integers (ir)"=i satisfying ik 7^ ik+i for 1 ^ ^ n; 

(ii) a collection of sets {Va)a=i satisfying Iq, ia+i ^ Uq as well as § \ U C Uq C § for 1 ^ a ^ n; 

(iii) a collection of sets (T^)J5=2 satisfying ^ as well as § \ U C C S for 2 ^ /? ^ ?i. 
Then we define the random variable, parametrized by (ir)"=i , (Uq)^^^' i'^p)^=2' 



F 

where 



((v)"ii\(U.)S=i,(T^)^.2) := (5.39) 



^ - n > Q = n 



a = l 13=2 



Note that F depends on the randomness via the resolvent matrix elements. All matrix elements are off- 
diagonal in the numerator and diagonal in the denominator; n counts the number of off-diagonal elements 
in the numerator. The sequence of indices of these matrix elements is consecutive if P/Q is written as an 
alternating product of off-diagonal elements from the numerator P and reciprocals of diagonal elements from 
the denominator Q, i.e. in the form 

Definition 5.8. for U C {1, . . . , N} and ij (^V define := Gf^\ For i,j (^E andV CS define Cfj" 
through (j5.ip . 
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Lemma 5.9. LetE> C {1, . . . , A^} and i, j ^ S. Then 



G^f = Gif . (5.41) 



// 7^ U C § then G^j can be written as 



2|U| K„ 

= E ^« = E^"'^' (5.42) 

n=|U|+l k=l 

where Y^'^=\v\+i -^"^n ^ 4l'^l|U|! and each F^^k is of the form (j5.39p (with a possible minus sign), with 
12,..., In € U, ii = i, in+i ~ j, and with some appropriately chosen sets {Va)a=i '"^'^ ('^/3)/3=2 which 
may be different for each Fn^k- 

Note: The index n in Fn and Fn^k refers to the number of off-diagonal elements appearing in the rational 
functions (|5.39p . while k is just a counting index. 

Proof. First, (|5.4ip follows from ([5TT|) . 

It remains to prove (|5.42p in the case U 7^ 0. Using Definition 15.81 and Remark |5.3[ one readily sees that, 
for a set T satisfying T n U = and i,j ^ § U T, we have 

Thus, if a G U C S, we get from ((5?T|) and ((5^ that 

G^'^ = ,^S\{'^}.U\{a} _ ^S,U\{a} ^ gS\{a} , U\{a} _ ^^S\{a} , U\{a} ^ i j^S (5 44) 

In the special case |1LJ| = 1, writing U = {a}, we have 

G^u = G^^'^^ = Gif^"»-G(f . (5.45) 

''J ''J ''J '■J ^ ' 

Using ((XT^ . we obtain ([O^ for the case |U| = 1, i.e., 

MS\{a})MS\{a}) 

For a general set U with |U| ^ 2, using (|5.44p we can write G^'^ iteratively as F — i^*^"^ where F itself is 
of the form E — E^''^ for some appropriate E. For example, for a, 6 g U we have 



a},V\{a} _ ^^S\{a},V\{a}^ 



(a) 



_ gS\{a&},U\{ab} _ |^gS\{a&},U\{ab}y^^ _ ^^S\{ab}, U\{a6} _ |^^S\{af)} , U\{a6} ^ "^^^ ^ 

RecaU = Fo tt^ from Definition [521 Then to prove (|5.42p in the case U with |U| 2, we use induction 
on |1LJ|. The key step is Lemma [5. 101 below, which contains the required properties of F — F^°'\ Its proof will 
be given later. 
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Lemma 5.10. Let F be of the form (|5.39p . We assume that 



U ) U ( U 

■13=2 



< 



If 



S i {ll,i2,i3,---,«n+l}U ( U U« ) U f IJ 



(Ao + Ad)logA^ 



a=l 



(5.47) 



(5.48) 



then F — F^^^ is equal to the sum (with signs ±j of 2n ~ 1 terms of the form (j5.39p . 

n 71—1 

F-F^^^ = E^'i(^^'^)-^^(u^'JStl,(T^^,)^^^^ (5.49) 
1=1 1=1 

where the new arguments, carrying a tilde, satisfy the following relations: 



= n + 1 and = n + 2 . 

(ii) For 1 ^ ? ^ n, the family [if'^.) is given by 

*Z,2' *Z,3' ■ • ■ ' *;,n+2) '= («1 J *2 S, ) . 

For 1 ^ i ^ ?? — 1, the family (if^) is given by 

«f2' *f3' • ■ • > «fri+3) ■= («1, «2, ■ • ■ , ihil + l,S, ■i;+2, . . . , in+l) ■ 

(lii) All sets U^*„, T^^^, Uf„, and Tf^ appearing in (|5.49p are subsets of 



|JuJu|jT^)u{5}. 



(5.50) 



(5.51) 



(5.52) 



(5.53) 



Now we return to complete the proof for Lemma [5791 Using (|5.44[) . we get for s G U and i,j^E> that 



Using induction on |U| and applying the decomposition (|5.42p to G 



S\{s}.V\{s} 



, we get 



g: 



;\{4,u\{s} 



2|U|-2 



K 

Fn — ^ ^ Fii }^ , 

fc=l 



(5.54) 



(5.55) 



where J2n=\v^ ^^n ^ ^1^1 ^(|U| — 1)! and each Fn,k is of the form (|5.39p (with a possible minus sign) with 

^2i ■ • ■ ; 

G U\ {s}, ii = i, i„+i = j, and with some appropriately chosen sets (Uq)"^i, {^13)^^2 satisfying 
S\UcUa, T0C§\{s}, Isja^n, 2s;^s;n. 
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Now from (|5.54p we get 

2|U|-2 K'^ 

= E Y.{Fr.>^-iFnJ'') . (5.56) 

n=|U| k=l 

Moreover, using (|5.49p we get 

n n— 1 

Fn,, - (F„,,)(^^ - E + ^ ^^„^,_, . (5.57) 

where each F^fj^ ^ and F^j. ^ is of the form (|5.39p (with a possible minus sign) with ii = i, i„i+i = j , where 
m = n + 2 for F^f/, j and m ~ n + 3 for F^f j. j, and the other indices belong to U. Here the sets {tS a)^=i and 
(f^)^^2 satisfy 

Furthermore, with (|5.50l) . the number of off-diagonal elements in the numerators of F^i^ ^ and F^ ^, ^ are n + 1 
and n + 2, respectively. Hence, together with (|5.56p . we obtain 

2|U|-2 K'^ / „ n-1 \ 



-.1 

k=l \l=l 1 = 1 



With the assumption of Y.T}\v\ ^'n =^ 4l"l-i(|U| - 1)! for the summation bounds in (jS^S]) . we know that 
G^f can be written in the form (|5.42p with X]^=|'u|+i ^ 4l^lHJ|!. This completes the proof of Lemma 



[531 □ 

Proof of Lemma 15.101 Using p.l2p it is easy to derive the following two identities for s ^ U: 

^(U)^(U) 

Gf = Gif ) + , for J ^ U U {.} , (5.58) 

= + m f nif ^ for fc^UU{s}. (5.59) 

Now (|5.58p implies that Lemma [5. 101 holds in the case n = 1. We shall first prove it for the case n = 2, and 
then give the proof of the general case. If n = 2 then by assumption F has the form 

^(U)^(V) 

with some sets U, V, T, and indices i, j, k. For s^UUVUTU {ijk} we get from (|5.58p that 

^(Us)^(V) ^(U)^(U)^(V) 
_ ^ij ^'jk '-'ts ^sj ^Jk . 
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Next, using (j5.59p on the first term, we obtain 



^{Vs)MV) ^(Us)^(T)^(T)^(V) ^(U)^(U)^(V) 
_ ^jk '^tj '^js <^sj S'fc '^ts <^sj '^jk ,^ 

^(Ts) ^ ^{Ts)^(T)^(T) ^(U)^(T) ■ 

Using (|5.58p again on the first term, we have 

^(Us)^(V)^(V) ^(Us)^(T)^(T)^(V) ^(U)^(U)^(V) 
^ ^ ^ ^(Ts)^(V) ^ ^(Ts)^(T)^(T) ^ ^(U)^(T) ' ^"''"'^^ 

One can easily check that the last three terms are of the form (|5.39p . and the indices satisfy (|5.50p - (|5.53p . 
This completes the proof for Lemma [5.101 in the case n = 2. 

Now we consider the case of a general n. Inserting (|5.58p and (|5.59p into the each term in (|5.39p . we have 



F 



{{^ry:±l{V^)l^„{Tf>r^^,) = ^, (5.64) 



where 



and 



^(Uc)^(U„) 



^ = n G5i.. = n ^5:1. + ^-^^ 



a=l 



n Ks 



13=2 13=2 \^if,,2f, "-^ifii/j "-^i/ji/j 



On the other hand, 



where 



{F(^{tr):+l{lJ^yUA'^p)p=2)f^ = (5.65) 



n 

p'-' = n ^5:1. (g(^))- = n 

Q = l /3=2 

For m S N, we write, using (|5.58p and (|5.59p . 



g!""!^, = A2™-i+B2™-i (5.66) 



where 



Similarly, we write 



i2m-l 



g51)^^ and i?2™-r := '"-^t,:^"'^- ■ (5.67) 



where 



{GZrJ!L^) ' = A2,n + B2ra, (5.68) 

:= {g[Ii+J^^^) and := (T,„+,)''"ti(T„ ")7^(t„+i.) ' (5-69) 
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Then 

2TI-1 2n-l 
m— 1 m— 1 

To complete the proof, we use the identity 

2n-l 2n-l 2ri-l /m-1 \ / 2n- 

n + - n ^™ E n n (^^- + ^^■) i ■ (^-^o) 

771—1 771—1 777—1 V J — 1 / \ J— 777+1 

It is easy to check that, for any term of the form 

( 277-1 
n (^i+B,) 

in the sum (jSJO)) . the desired properties ([09)) - ([533)) hold. □ 

We may now easily obtain the following bound on G^j*^. 
Lemma 5.11. Lei U C § C {1, 2, . . . , iV} and 

|G|;®-m,,Jy| c(l{i^3)Kd + K) ■ (5.72) 
// in addition U 7^ and i,j ^ S, then 

^ (G|U|Ao)l"l+^ (5.73) 



<^7J 



and 

|(1/G,0^-"| ^ (G|U|Ao)l"l+i . (5.74) 

Proof. The estimate (|5.72|) follows (|5.4ip and ()5.33p . In order to prove ()5.73p . we apply Lemma [5Jl to each 
G^f, and get 

2|U| K„ 

= E ^" = J2^r^,k, (5.75) 

77=|U| + 1 fc=l 

where J2n=\v\+i ^ 4l'^l|U|!. Here each Fn^k is of the form (|5.39p (with a possible minus sign), where 
n counts the number of off-diagonal elements in the numerator; the indices satisfy 12, . . . , i„ £ U, ii = i, 
in+i = j- Note that the factors P in (|5.39p are the product of off-diagonal terms and the factors Q the 
product of diagonal terms. Applying (|5.33p and (|5.34p on the off-diagonal and diagonal terms in P and Q, 
we get 

i^77,;= ^ ^^T^ ^ (GA,)". (5.76) 

^2|0| 



Together with E„ " u|+i^^" ^ 4l"l|U|!, this implies (15751) . 
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In order to prove (|5.74p . we observe that, similarly to Lemma [5.91 we have 

|(.(T)|^l"l + l 



maxfe jdf TcsK-rti , , 

i(i/G.n ^ ■ Z.'m^^ ^'-''^ 



( ■ lri(T) 

I ^:l^lj^T,Tcs|'-^jj 



providcd that 

Hence (|5.74p follows. □ 



l^(T)| ^ ■ |^(T) 

max GL ^ mm G,-,,- 

fcj'^T,TcS' ' J^T,TCS' 



5.3. Proof of Lemma 14.11 Observe first that (|4.3p follows immediately from (14. 2p and Lemma 13.151 It 
therefore remains to prove (j4.2p . 

We define the event S by requiring that on it (|4.ip and the following two events hold: 



(i) For every z e _D we have 



Ao(z) < GQ + (logiV)2«vi/(z)j <; G|^i + (logiVf«^ 



'lmmsc(z) + 7(2:) 



Nri 



(5.78) 



(ii) For every z € D wc have 

max|G,,(z)-m(z)| ^ Alog^)^ +{logNf^^iz) 



^ C (^^^ + (logiV)^y I^^^^^^41±^j . (5.79) 

Now Theorem l3.11 Lemmas 13. 131 and 13. 141 as well as (|4.ip and D C Dl imply that S holds with (^ — 1/2, i/)- 
high probability. Note that here we reduced the ^ to ^ — 1/2 to account for the intersection of three events 
of (^, i^)-high probability. It is crucial that ly remain constant in this step, as in some applications, such as 
Theorem 12.81 it is interated. 
We write as 

= T.(f'''^-N)G^kl+T.^'^'^Gk}hu. (5.80) 

fe ^ ^ k^l 

Lemma |4J] follows from the next two lemmas. As before, we shall consistently omit the spectral parameter 
z £ D from the notation in the following arguments. 

Lemma 5.12. On S we have, with {^,iy)-high probability, 
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Lemma 5.13. On ^ we have, with — 2^v)-high probability, 



^ (logiV) 



14? 



Proof of Lemma 15.121 We split the sum inside the absolute value on left-hand side of (|5.8ip as 



m'*^ — m I + 



i^k ^ 



(5.82) 



(5.83) 



In order to estimate the first term of (|5.83p . we use the estimate p.l7p (with (/i.^^. — N ^) playing the role 
of a,;, and setting Ai = N^^, a = 2, /3 = —2, and 7 = 1) to get, with (^, z^)-high probability, 



i^k ^ 



^ (logiV)^-^ 



(5.84) 



Therefore 



C(log7V)^ — 



(5.85) 



Similarly, in order to estimate the second term of ()5.83p . we fix i and sum over k, which yields, with 
(^, i/)-high probability. 



max 



E ( ''^fc " N 



^ (log7V)«g 



(5.86) 



where the sum over k was estimated by (|3.17p . This yields, with (^,i/)-high probability. 



i k 

Using (|3.3ip wc have in S 



i(^)^EEK--^)( 



' — m 



m'*-* — m) 



(5.87) 



1 ^ji^^j 



Thus we get, with (^, i')-high probability, 

i(E)-y(hi-- 



i^k 



N ^ G, 



rrS'^^ — m 



^ O 



C{\ogN) 



g Im rrisc + 7 
qNr] 



(5.88) 



Finally, we estimate the third term of (|5.83p . First, with (|3.12l) and \msc\ ^ c, we note that if A^^ ^ 1 
then 



^ CKl for i,j ^k. 



(5.89) 
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Together with (I5.79|) we get, with (^,j/)-high probabihty 



max 

fe/i 



25 ( 1 / Im rrisc + 7 
q V iVr/ 



(5.90) 



Then we use p.lOp (with (/i^^, — ^) playing the role of and G^'^ — m'*) playing the role of A^-, and 
setting a = 2, P = —2, and 7 = 1) to get, with (^, i')-high probability, 



max 



^ (log7V)4« I 1 + 



1 / Im TUsc + 7 



9 



(5.91) 



Hence we have, with (^, ;/)-high probability. 



sc: (logiV) 



4C 



/ Im nisc + 7 



(5.92) 



Note that, in applying p.l9p . we used that the family {lijf^—N is independent of the family {G[,*^— m'*-'}fc. 
Combining and (|02)) . we obtain (ISTSTj) . □ 

Proof of Lemma [5. 131 We shall apply Theorem l5.6l to the quantities 



(i) (iV) 

^ /i.fcG«/iH, 1(* ^ V) 5] /i.^G^f /i;, , 

fc/i k^l 



(5.93) 



and define S as in the beginning of Section 15.31 i.e. is defined by requiring that (|4.ip and (|5.78p - (|5.79p 
hold. Recall that the collection of random variables zf^ generates random variables Z^'^ for any U C § by 
dSU). Let 

p := {logNf-^/'^ . (5.94) 

Next, choose 



X := l + (logiV)^s/^""^^^ 



y (logiV) 



2C 



(5.95) 



(In other words, X is defined as the right-hand side of (|5.78p up to a constant.) We now derive a bound 
which implies both (|5.6p and (|5.9p , i.e. we establish the assumptions (ii) and (iii) of Theorem 15.61 To this 
end, we shall prove the stronger statement that, for z ^ §, r ^ p and any sets U C § with |§| ^ p, we have 



E(l([S],)|2f"r) {Y{CXuYY for u 
Using the assumptions of Lemma 14.11 we have in D that 



(5.96) 



q > (log7V)^«, Nri > (logTV)"^, 7 ^ (logiV) 



14? 



(5.97) 
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It is therefore easy to check that Zi and ^ satisfy the assumptions (i), (iv), and (v) of Theorem 15.61 Thus, 
the conclusion of Theorem 15.61 (|5.12p , imphes the claim (|5.82p . 

It remains to prove (|5.96p . By the definition of Z^"^ in (j5.1|) and (|5.93p . for i ^ §, we have 



V:S\UCVCS fe^i 
(iUS\U) 

ki^l V:S\UCVCS\{fc,i} 

(iUS\U) 

= E ^^^^^ E (-i)i'\"i(-i)i^iGif 

ki^l V:S\UCVcS\{fc,i} 
(iUS\U) 

where in the last equality we used definition of G^''^, Definition 15.81 Thus we may write 



(5.98) 



(5.99) 



where 



Ai := E E [Gfei] 



i(Si)\{fc,;},u\{fc,i} 



fceu;gu\{fc} 



^3 — EE '^^fc'*'* i^fc'] 



(S0\{O,u\{i} 



^4 



(Sifc) 

E E '^'fc'^" [^'t'] 

fcGU ; 

(Si 



(S»)\{fc},U\{fc} 



Now we have 

4 

E(l([S],)|zf'"r) = E(l([S]0|Ai+A2+A3 + A4r) ^ 4'-EE(l([S]0|A,r), (5.100) 

and we are going to bound E^l ([ S]i) \Aj f j for each j = 1, 2, 3, 4. Using the assumption (|4.ip . i.e. 

A ^ 7 ^ (logiV)-^ (5.101) 

(j5.78p . and (j5.79p . we get Aq + ^ G(log A^)~^, which implies the assumption (|5.7ip of Lemma [5. Ill 

Throughout the following we set u := |U| + 1. We begin by estimating the contribution of A\. Observe 
that if i 7^ fc, I G A and i ^ B, then [Gfe;]*'* is independent of the i-th row and column of H. (The same 
argument will be repeatedly used in the rest of the proof below.) Thus we have 



1([S],)[G., 



(Si)\{fc,;},u\{fc,i} 



1(S) [G^^](sO\{fc.0.u\{fc,;} 



< (G|U|X)l"l-i 
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where in the second step we used (|5.73p and Aq ^ CX on S. Thus we find, using |1LJ| ^ |§| ^ p = (log A^)^ "^/^ 
and q^^ ^ X , that 

El([S]0|Ai|'- ^ (logiV)2«maxE|/i,fer|/iHr((C|U|X)l"l-i)^ 

{\ogNf^q-^-{[C\\J\X)\''\-^Y 
s$ {Y{CXuTY . 
In order to bound the contribution of A2 we estimate, as above, 



(5.102) 



kl\ 



.{Si)\{k},V\{k} 



1(5) [Gu] 



(S»)\{fc},U\{fc} 



where in the last step we used (|5.73p and Aq ^ CX on ^. Thus we may apply the moment estimate (jA.4[) 
from Appendix |^ with 

Bu l(fceU)l(/^§U«)l([S],) [G,;]™'=>'U\W . 



This yields 



Ei([s],)|A2r ^ (Cr)2'-|^Q + (^-L(iogiv)«Ary^'^(qu|x)i"ij (Cr)^'- (x(qu|x)i 

where we used that |U| ^ (logiV)^, that the B^i defined above are independent of the randomness in the 
z-th column of H , and that 



q ViV ^ g ^ ^ ^ Y iV?? 

as follows from Im ^ y/fj and ?7 ^ 3. Thus we get 

El([S],)|A2r (Y{CXuYy . 

(Recah that m = |U| + 1.) 

Exchanging k and I in the above estimate of A2, we obtain 

El([S],)|A3r {Y{CXu)y . 
Finally, we estimate the contribution of A4. As above, we estimate 



^ X 



(5.103) 



(5.104) 



Hm [G. 



kl\ 



i(Sj),U 



< (C|U|X)I 



1+1 



1(S) [GkU 

by ()5.73p and Aq ^ CX on S. We may now apply the moment estimate ()A.4p from Appendix lAl with 

Bki l{k,l(^nu{z})l{[E],)[Gkif ■ 

This yields 

El([S],)|A2r ^ (r2(qU|X)l"l+i)\ 
where we used that the B^i arc independent of the randomness in the i-th column of H. This gives 

El([S],)|A2r {Y{CXu)y . (5.105) 
Combining (|5.102l) . (|5.103p . (|5.104p . and (|5.105p . we obtain ((06)) . This completes the proof. □ 
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6. The largest eigenvalue of A 



6.1. Eigenvalue interlacing. Wc now concentrate on the spectrum of A. We begin by proving the following 
interlacing property. Recall that Ai ^ . . . ^ Xn denote the eigenvalues of H and /ii ^ . . . ^ fiN Mmax 
the eigenvalues of A. The associated eigenvectors of H arc denoted by Ui,...,ujv, and those of A by 
vi, . . . , Vat =: Vinax- Also, we set G{z) := {A - z)^^. 

Lemma 6.1. The eigenvalues of H and A are interlaced, 

Al ^ A^l ^ A2 ^ /-t2 ^ ... ^ Mw-i ^ ^ . (6.1) 

Proof. We use the identity 

(e,Ge)-i = f+{e,Gey\ (6.2) 

which follows by taking (e , • e) in 

G{z){A~ z)G(z) ^ G{z)(H - z)G{z) + fG{z)\e){e\G{z) . 

From (|6.2p we get 



It is easy to see that the left-hand side of (|6.3p defines a function of z € R with iV — 1 singularities and N 
zeros, which is smooth and decreasing away from the singularities. Moreover, its zeros are the eigenvalues 
of A. The interlacing property now follows from the fact that z is an eigenvalue of H if and only if the 
right-hand side of ()6.3p is equal to /. □ 



6.2. The laws of /Xmax and v,„ax- In this section we establish the basic properties of /imax and Vmax- We 
make the assumption that f ^ 1 + Eq uniformly in N (see (j6.4p below) , which is necessary to guarantee that 
fiN-i and fiN are separated by a gap of order one. Note that in |26j it was for proved, in the case where H 
is a Hermitian Wigner matrix, that if / ^ 1 no such gap exists. 

The following result collects the main properties of /imax and v^ax for the rank-one perturbation A = 
H + /|e)(e| of the sparse matrix H. The most important technical result is (|6.9p . It states that, for large 
/, the eigenvector Vmax is almost parallel to the perturbation e. Consequently, e is almost orthogonal to 
the eigenvectors Vq, for a = 1, . . . , iV — 1 (Corollarv l6.7p . As it turns out, this near orthogonality is the key 
input for establishing the local semicircle law for A in Section [T] We refer to the discussion at the beginning 
of Section 17.11 for more details on the use Corollary 16.71 

Theorem 6.2. Suppose that A satisfies Definition \2.S\ and that in addition to (j2.8p we have the lower bound 

f > 1+eo (6.4) 

for some constant Eq > 0. 

Then we have with {^,i>)-high probability 

Mmax = / +i+0(l). (6.5) 
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In particular, there is a constant c, depending on Sq, such that with {^,i')-high probability we have 

Mmax ^ 2 + c. (6.6) 

Also, we have 

E/^-x = f+J + o{j-3+^g + ^)' (6-7) 



as well as, with (^,v)-high probability. 



M.ax = /+^+0(^^ + ^ + -^). (6.8) 



Note that (|6.7p and (|6.8p locate /imax JTiore precisely than ()6.5p m f/ie large- f regime. 
Moreover, the phase o/ v^ax can &e chosen so that we have with {S^,v)-high probability 

(v.ax,e)^l-^ + 0(-l + M^). (6.9) 

Finally, there is a constant Cq such that if 

/ ^ Co(logiV)2« and ^^2, (6.10) 
then we have with {£^,v)-high probability 

2C 



Mmax = lE/i,„ax + ^ E '^^J + O ( ^^^^ ) • (6-11) 



/n particular if (j6.10p /loZds we /laue (by the central limit theorem), 



2 (Mmax - E^iinax) ^ AA(0, 1) (6.12) 

m distribution, where Af{0, 1) denotes a standard normal random variable. 

Remark 6.3. In analogy to Definition [531 we define A^'^^ as tlie (TV- |T|) x (TV- |T|) minor of A obtained by 
removing all columns of A indexed by i G T; here T C {1, . . . , N}. If A satisfies Definition 12. 2[ then so does 

{N/{N - |T|))^^^A(^). Therefore ah results of this section also hold for A(^) provided |T| < 10. (Here 10 

can be any fixed number.) Throughout Sections [B] and [7] we abbreviate /imix := ^-nd Vmix 

Remark 6.4. Statistical properties of the k largest eigenvalues of a random Wigner matrix with a large rank- 
k perturbation have been studied in [3l[5] |26ll35] . Theorem 16 . 21 collects analogous results for the more singular 
case of sparse matrices. We restrict our attention to the special case where the perturbation is /|e) (e| . (Note 
that in [51[51l35j. the authors allow quite general finite-rank perturbations of Wigner matrices.) 

The rest of this section is devoted to the proof of Theorem 16.21 It is based on the following standard 
observation. Let /i be an eigenvalue of A with associated normalized eigenvector v. This means that 

{^l-H)v = /(e,v)e. 
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Suppose now that ^ is not an eigenvalue of H . Thus wo can choose v and K > such that 

V = K{^i-H)-^e, (6.13) 

1 = f{e,{fi-H)-'e). (6.14) 

Using the spectral decomposition of H we rewrite ()6.14|) as 

1 ^Ki^. (6.15) 
It is easy to see that (|6.15p has a unique solution, /^,„ax, greater than Xj^. Moreover, (|6.15p readily yields 

/^max ^ ^ f ^ Mmax ^ ^i, i.e. 

Mmax e [/ + Ai,/ + AAr]. (6.16) 

Our proof is based on the series expansions 



/^(e,(if^,„ax)'e), (6.17) 



fc>0 



max 

)^e. (6.18) 



Note that the expansions (|6.17p and (|6.18p can be interpreted as perturbative corrections around the matrix 
/|e)(e|. 

In order to control the expansions (|6.17p and (|6.18p . we shall need the following large deviation bound, 
proved in Appendix El 

Lemma 6.5. Let 1 s^k i^logN. Then 



{e,H''e) -E{e,H''e) 

with {^,h')-high probability provided that 1 ^ g ^ CN^^^. 
Proof of (|6.5p . The key observation is that 



^ (6-19) 



E(e , H'^e) - j x'' Qsc{x) dx 



^ (6.20) 



for some constant C(fc) depending on k. Indeed, a standard application of the moment method (see e.g. j29j . 
Section 1.2) shows that E(e,i/2"e) = C„ + 0„(g~2), where C„ := = j x^"- Qsc{x) dx is the n-th 

Catalan number. If k is odd, one finds by a similar moment estimate that E,{e,H''e) = Ok{q~^)- We omit 
the details. 

For the following we work on the event of (^, i')-high probability on which (j4.29p holds. We consider 
solutions fi of ()6.14p in the interval I :~ [2 + eQ/20,oo). By monotonicity of the right-hand side of (|6.15p 
in /, we know that (|6.14p has at most one solution in /. For any ko E N, using ()6.19p and (|6.20p we may 
expand ([gH]) in / (see (pTT]) ') as 



H\\Y , C(fco) , fco(logiV)«^-" 



k=0 \ k>ko 

k / ^ \ k 



N 



f ^ ^ , f Y , f 2 W Cjko) , fco(logjV)^fe" 

/.„ww+o(/E(,^j ^fi:^{jT4m) 
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where the first term comes from extending the sum over k to infinity and using that 

'x\'' f gsc{x)dx 



X] / (~) Qsc{x)dx = / ■ 



fi — x 



for fi > 2. It is easy to see that the second term is o(l) by an appropriate choice of ko{N). Thus we have 
proved that, for fi & I, the equation (|6.14p reads niscifj) = — /^^ + where r{^) — )• as — > oo 

uniformly in /z. 

Next, the function ^ i— > msc(M) is continuous and monotone increasing on (2,cxd), with range (—1,0). 
Let /2 be the unique solution of msdp.) = —f^^- (Note that here we need the assumption / > 1.) Using 
(|2.13p we find that p. ^ f + f^^ ^ 2 + Sq/IO. We therefore find that, for N large enough, the equation 
"Isc(m) = ~f~^ + ^(m) (which is equivalent to (|6.14l) on /) has a unique solution ^ £ I which satisfies 
fi = fl + o(l). Since ^ is the only solution of (|6.14p in /. we must have fi = fJ.ma.x- D 

Note that (|6.5p remains valid if e in ()2.7p is replaced with any ^^-normalized vector. It is a simple matter 
to check that (|6.20p is valid for arbitrary vectors e. Moreover, Lemma 16.51 remains correct for arbitrary e 
provided one replaces N~^^'^ on the right-hand side of (|6.19p with q^^. We omit the details as we shall not 
need this result. 

From ((6^ and (|429)) we find that, with (^, i^)-high probability, 

\\H\\ 

1-c (6.21) 



for some constant c > 0. In particular, (j6.17p and (|6.18p converge with (^, i/)-high probability. 

Proof of (|6.8p . From (|6.17p and Lemma 1431 we find /imax = ./(! + ?'(/)) with (^, i^)-high probability, where 
limy_j.oo ?'(/) = 0. Together with the simple identities 

E{e,He) = 0, E{e,H'^e) = 1, (6.22) 

(|6.17p . (|6.2ip . and Lemma l675l vield. with (^, i/)-high probability. 



By explicit computation we find that 

E{e,H^e) = 0{q-'^) . (6.24) 
Thus MM follows. □ 



Proof of (p?7)) . From (pT7|) and (|OT|) we get with (^, :/)-high probability 

/imax = f + ^{e,He) + ^{e,H^e)+o(^ + ^], (6.25) 

Mmax Mmax V / ./ 

where we used (|6.24p . Iterating (|6.25p yields with (^, ;y)-high probability 

Mmax = f + {e,He)-{e,He)y,f+{e,He)yf-{e,He){e,H^e)/f 



{e,H'e)/f-2{e,He){e,H'e)/f' + 0{— + —j , (6.26) 
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where we used Lemma 14.31 In order to complete the proof of (j6.7p . we use the rough estimate estimate 
Enl^^^ ETr^2 CNp + iV s$ iV^, by (g^. Recahing (jgl^ . we also get 

\E{e,Hef\ ^ ^, \E{e,He)^\ ^ \E{e , He){e , H^e)\ s=: 

by explicit calculation using (|2.5p . Now taking the expectation in (|6.26p using (|6.22p yields (|6.7p . □ 
Proof of (|6.9p . We compute the normalization constant If in (|6.18p from 

k,k'>Q 



l + JL + ofA+(l££^| ,6.27) 



with i/)-high probability, where we used Lemmas 16.51 and |4.3[ as well as (|6.8p and (|6.2ip . Now (|6.9p is an 
easy consequence of (|6.18p . (|6.8p . and Lemmas 16.51 and □ 

What remains is to prove (|6.1ip . 

Proof of ((6?TT|) . We assume (|6.10p . and in particular /imax 5^ ^(log A^)^^ by (|6.5p . Thus from Lemma 14.31 
we get, with (^, i')-high probability, 

\\H\\ ^ 6 



Mmax Co(log7V)2?' 

From ()6.17p and (|6.8p we therefore get, with (^, ;^)-high probability, 

cologJV , . 



Mm ax 



Mmax 



where cq ^ 1 is a positive constant to be chosen later. Thus we find with (^,z/)-high probabihty 

fc=0 ^max A'max /^max 

Therefore we get, for any < cq ^ 1 and using (|6.8p and Lemma 16.51 that with (^, ;^)-high probability we 
have 

fe=0 

Here the constant in O(-) depends on cq. 
Next, Lemma [4.31 vields 



/^max Mmax V fvN 



C(e,ij'''e)| ==; (5/2)'^ + Ar^'^e-'^('°s^)' 3\ (6.28) 
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for k ^ (;^/C)(log A^)^^^. (Here we used Schwarz's inequality and the trivial estimate E(e,i/e) ^ N'-' to 
estimate the contribution of the low-probability event on which (|4.29p does not hold.) By the assumption 
(|6.10p on ^, (|6.28|) holds for k ^ cq log N for co small enough. It is therefore easy to see that the equation 

k=0 ^ 

has a unique solution /2 > 0, which satisfies p. = f + 0{f^^). Writing /imax = /l + Ci '^c get with i^)-high 
probability 



Next, by (|6.8p we find C = 0(/ ^) with , zy)-high probability. Moreover, (|6.22p and Lemma [6.51 implv 
that (e,iJe) = 0((log A^)« A^^^/^) with i/)-high probability, and that the sum in (|09| starts at fc = 2. 
This yields the expression with (^, ;y)-high probability 

. 

C = (e 



for some coefficients a/ = 0(1), by (|6.28p . We conclude that with (^, z^)-high probability we have 

,.,,.,,.0,r',).0(<l^).(e,.e,.0(«^), 

where we used that |(e,i7e)| ^ {\ogN)^N~^^'^ with (^, ;/)-high probability. 
Summarizing, we have proved that with (^, ;y)-high probability we have 

A*max = A* + ^ X! + ^ ' (6.30) 

id 

where \R\ ^ o(^ii^£^y Using E/x^^^ ^ iV^ we therefore get E|i?| ^ o(^ii^^), and (IHTD) follows by 
taking the expectation in (|6.30p . □ 



This concludes the proof of Theorem [621 

For future reference, we record two simple corollaries which we shall use in Section[7]to control the matrix 
elements of G. 

Corollary 6.6. Suppose that A satisfies Definition \2.2i Then we have with {S,,v)-high probability 

l^al ^ max|A;3| = \\H\\ 2 + (log N)^q-^^^ for a=l,...,N -1. (6.31) 

Proof. Use (p?T|) and Lemma [321 □ 

Corollary 6.7. Suppose that A satisfies DeRnition \2.2\ and that, in addition, f CqN^^^. Then we have 
with {^,v)-high probability 

^|(v„,e)p = 0{f-'). (6.32) 

Proof. The statement is trivial unless / ^ 1 + eo, in which case we use (|6.9p and ||ej| = 1. □ 



53 



7. Control of G\ Proofs of Theorems 12.91 and 12.161 

In this section we adopt the convention that if F = F{H) is any function of H then F{A) is denoted by F, 
i.e. we use the tilde (•) to indicate quantities defined in terms of A = H + /|e)(e|. Thus, for example, we 
have 

A = H, Ha = >'a, Va = u„, G{z) := {A-zy^, fh[z) := j^^Guiz) , 

and 

Ao := maxIGij I, A^ := max\Gii - msc\ , A := \m - nisd , ■= G^i - ttIsc ■ (7.1) 

Note that G and m were already introduced in (j2.19p . 

We begin by using the interlacing property (j6.1|) to derive a bound on A. Recall the convention that if 
F = F{H) is any function of H then F is defined as F{A)^ where, we recall, A = H + /|e)(e|. 

Lemma 7.1. Let A satisfy Definition \2.2l Then for any z Cz Dl we have 

Proof. Define the empirical density g{x) :— jj 5{x~ fia)- Thus the integrated empirical density defined 

in (|2.23p can be written as n(£') ~ J^ao ^(^) Similarly, define the quantities g and n in terms of the 
eigenvalues Ai, . . . , Aat of H. Using integration by parts we find 

A(.)-A(z) = /MzL^dx = - f ^i-)-^[-) 
J x-z J [x~zY 

By (|6.ip we have |n(x) — n(.T)| ^ for all x. Thus we find 

We note that the claim p.20p of Theorem 12.91 is now an immediate consequence of Lemma 17.11 and the 
strong local semicircle law (j2.16p for Fl. 

The rest of this section is devoted to the proof of the estimate p.22p for the matrix elements of G. From 
now on we consistently assume the upper bound (j2.2ip on /. 

7.1. Basic estimates on the good events. In this section we control the individual matrix elements Gij 
in terms of A. which in turn will be estimated using Lemma 17.11 Our basic strategy is similar to that of 
Section [3l but, owing to the no n- vanishing expectation of Oy, the self-consistent equation for Ga has several 
additional error terms as compared to Lemma 13.101 see Lemma 17.21 and Proposition 17.61 below. The most 
dangerous of these error terms is estimated in Lemma 17.51 below. We will use the spectral decomposition of 
combined with bounds on (e,Vfj) and j|va|joo- The former quantities are estimated using Corollarv 16.71 
while the latter are estimated by bootstrapping. The spectral decomposition requires simultaneous control 
of all eigenvectors, whose associated eigenvalues are distributed throughout the spectrum. Since bounds 
on ||vq||oo (delocalization bounds) may be derived from a-priori bounds on Afi(z) for Rez being near the 



54 



corresponding eigenvalue, we will therefore need bounds on Arf(z) that arc uniform for all z S I?l with a 
fixed imaginary part. Hence the bootstrapping now occurs simultaneously for all G [— S, S] (see Definition 
O below). 

We use the following self-consistent equation for G, whose proof is an elementary calculation using p.l2p 
and p.l3p applied to G; see also Lemma [3. 101 



Lemma 7.2. We have the identity 



Gii = (7.2) 

-z - nisc - i[v\ - T.j) 



where 
and 

(i) ~ ~ 

:= m.^a,,G^au. M ^ - ^^(e, G«e) + ^ E ^ ■ (7-3) 

k,l j 

Recall that in expressions such as (|7.3p the vector e stands for e^-i; see (|2.2p . 

Definition 7.3. For N^^{logN)'" ^ 77 ^ 3 introduce the set D{t]) := {z G Dl : Imz = 77}. We define the 
event 

n{Tj) ( sup (Ad{z) + A,{z)) (logiV)-4- (7-4) 

Recall the definition of A^^^ from Remark 16.31 Similarly to Lemma [3. 121 we have the following result for 
the matrix A. 

Lemma 7.4. Fixing z = E + irj ^ Dl, we have for any i and T C {1, . . . , iV} satisfying i ^ T and |T| ^ 10 
that 

(7.5) 



m(")(z) = ,?^m(z) + 0(^J-), 



holds in ^(jj). 



The following lemma is crucial in dealing with error terms arising from the non- vanishing expectation of 
j. Recall that, when indexing the eigenvalues and eigenvectors of A^'^\ we defined amax ■= N — |T|. 



Lemma 7.5. Fixing z ^ E + irj ^ D^, we have for any T C {1, . . . , N} satisfying |T| ^ 10 and for any i e T 
that 

(T) , 



N 

k,l 



on Q{r]) with {^,v)-high probability. 



Proof. For technical reasons, it is convenient to avoid the situation where //max is close to S. In order to 
ensure this, we may if necessary increase S slightly and hence assume that / ^ S — 3 or / ^ S + 3. We start 
by proving the following delocalization bound. Define 

R := max max maxIwi^Vi)! , Rmax '■= max max|wfJ25r(7)l • (7.7) 
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First we claim that on fl^rj) we have with z^)-high probability 

R < Cy^ 

and, assuming / ^ S — 3, we have with j/)-high probability 

In order to prove (|7.8p and (|7.9p . we note that on ^1{t]) we have, in analogy to p.29p . 

c |g'P(z)| < c 



(7.8) 
(7.9) 

(7.10) 



for all z G Dl such that Imz = rj and large enough. From (|6.3ip we find that z := yUa + i?? G Dl with 
(^, i/)-high probability for a ^ amax; see Remark 16. 31 Thus we get with (^, i')-high probability 



C ^ ImG5?(^m+ir,) = Y: 



This concludes the proof of (|7.8p . Next, if / ^ S — 3 then by (|6.5p and Lemma l4?3l we have /imax G [— S, S] 
with (^, i')-high probability. Thus we get (|7.9p just like above. 

Having established (|7.8p and (|7.9p . we may now estimate the left-hand side of (|7.6p . using the spectral 
decomposition of G^'^^ by 



/ 



I (T) \ (T) 
^e,Vinax; (T) .n, 

(T) / y ■^maxVv"-'! 



/ 



V^^- |T| 



/ (T)x (T) 



(7.11) 



By the dclocalization bound (j7.8p and the large deviation estimate p.l9p . we find for a ^ a,„ax on fi(ry) 
with (^, i')-high probability 



Similarly, we have 



(T) 



(T) 



< (logiV)q - + 



i? 1 



^ (log^)^ 



Next, we estimate the first term of (|7.1ip . If / ^ E — 3 then / ^ G, and the first term of (|7.1ip is bounded, 
with (^, i')-high probability, by 



G 



1 



(iogiv)«( ^ + — ) ^^(iogiv)«( - + ^ 



1 1 



If / ^5 S + 3 then by (|6.5p and (|6.8p we get |/Xmax — zj ^ c/ with (^, j/)-high probability. Thus the first term 
of (|7.11l) is bounded, with (^, i/)-high probability, by 



G/ (logTV)? ^1 

.9 \/iV 



G(log7V)« 
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where we used the trivial bound i?i„ax =^ 1- We therefore get that the left-hand side of (|7.6p is bounded with 
z^)-high probability by 



C(log AT)? ( 1 + ^ ) + C(log 7V)« 



X] I (T) 



1/2 



By (|6.32p this becomes 

C(logiV)« 
Using (|7.5p we get 



1 1 



+ C(log7V)^ 



1 / R 1 



1/2 



X I (T) 



(T)_ ,2 



- Im = — Im Tr G*-^' = — Im i 



O 



which therefore yields the bound 



Im m H — 
'1 . 



on $7(77) with i')-high probability. Here we used (|7.8p . The claim follows. 
For the following statements it is convenient to abbreviate 



'lmm(z) 1 
Nr] ^ Nt] 



Proposition 7.6. Assume (j2.2ip . T/ien /or z = i? + i?; e we /laue 

A„(z) C$(z), 
max|Z,(z)| C$(z) , 

i 

C 



max|^i(z)| ^ 



Nr] ' 



in ^(rj) with {^,i>)-high probability. 



(7.12) 



□ 



(7.13) 



(7.14) 
(7.15) 

(7.16) 



Proof. We start with (|7.14p . Let i ^ j. Using p.lSp for A instead of H, and writing = f/N + h^j, we 
get with (^, ;y)-high probability 



C-'\G,,\ ^ - 



(ij) 

k,l 



(ij) 



k,l 



k,l 



EJ_Q{i3)J_ 
N N 

k,l 



(7.17) 
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by Lemma l3Jl and (fTTOl . 

The second term of (|7.17p is bounded exactly as in p.34p and p.35p : using p.22p and (|7.10p we estimate 
it by 



on $1(77) with i')-high probabihty. 

The last term of (j7.17p is bounded with ;^)-high probability by 

^ £1 + ^ + 11 y |(e,vfe))|' ^ ^ + (7.18) 
^ N Nri Nn ^ I \ ' a / 1 TVji ' ^ ^ 

where in the first step we used (|6.8p . and in the last step (|6.32p . Here we estimated the term arising from 
M^nil by C(iV77)-i if / ^ 2S, and by C//7V if / > 2E. 

Using Lemma [7751 the third and fourth terms in (|7.17l) are bounded on ri(?y) with i/)-high probability 
by the right-hand side of (|7.6I) . This concludes the proof of (|7.14l) . 

Next, we prove (|7.15l) . By definition, 

= ^/,,,G«i-+^^G«/,,, + IE,^/i,,G«/,,,. (7.19) 

The first two terms are bounded using Lemma [7.51 and the last one exactly as p.33p . 

Finally, we prove Using ((O)) . (lO^ . and (TTTU)) . we find on f2(7y), with (^, i^)-high proba- 

bility, 

~ / /^ jV-l|(e,v^L)p pN-1 ^ \(eM^ l^G^ 



f f' N -1 1 
N N N f 



1 + 



(7.20) 

where in the second step we distinguished the two cases / ^ 2E and / ^ 2E, as in (j7.18p . □ 

We may now estimate in terms of A. 
Lemma 7.7. Assume (|2.2ip . For z = E + irj e we have 

max G,j(z) - m(z) ^ C^{z) (7.21) 

i 

on Q{ri) with {^,iy)-high probability. In particular, on ^2(77) we have, with {^,iy)-high probability, 

Ad{z) ^ A{z) + C<i>{z) . (7.22) 
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Proof. Using (TTTS)) . ((7?T6| . and Lemma l3Jl we find 



max|T,| (logiV)«- + C(log7V)2«( J5^ + -L ) (7.23) 



q \\l Nr] Nrj 

on Vl{rj) witli i')-higli probability. From (|7.2p we therefore get 



|G,.||G,,||T,-T,| ^ (logAr)«^ + G(logArr«U^^ + ^j (7.24) 



on rJ(ry) with (^, ;/)-high probabihty. Since m = ^ Gjj, the proof is complete. □ 



7.2. Establishing ^{rj) with high probability. What remains to complete the proof of Theorem 12.91 is to 
prove that the events fi(f?) hold with (^, i')-high probability. We do this using using a simplified version of 
the continuity argument of Sections 13.41 and [ 



Lemma 7.8. J/?; ^ 2 then ^{rf) holds with (^,iy)-high probability. 

Proof. The proof is similar that of Lemma [3. 161 we merely sketch the modifications. 

Let z = E + if] E Dl for rj ^ 2. We estimate Ao(z) following closely the proof of (|7.14p . using (|7.6p and 
setting i? = 1 in (|7.12[) . Using the rough bound |Gy| + |77i| ^ 1 as in p.43|) . we find 

A„ < (log7V)«^ + (logiV)2«-^ + ^ ^ G(logiV)-2€ 
with (^, i')-high probability. Similarly, we find 

|Z,| ^ (logiV)«- + (log7V)2«-^ ^ (logiV)-2« 
Q V JV 



with (^, ;/)-high probability. In order to estimate Ai, we proceed similarly to p.44p and find 

1-^'^' ^ ^ + 7^|(e'G«e)| + l5]|Gg||G,,|(K| + |4|). 

j 

The term Zij is estimated exactly as Aq above; using the calculation of (|7.18p we therefore get 

~ G G G f 

\A\ ^ (log7V)«^ + (log7V)2«-^ + ^ < G(logiV)-2« 
9 V 



with (^, i/)-high probability. 

Now we may follow the proof of Lemma [3. 161 to the letter, starting from (|3.45p to get A^ ^ G(logiV)^^? 
with (^, ;/)-high probability. 

Thus we have proved that Ad{z) + Ao{z) ^ G(logAf)^^^ with (^, zy)-high probability. A simple lattice 
argument along the lines of CoroUarv 13.191 then concludes the proof. □ 
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The following simple continuity argument establishes ^{rj) with ;y)-high probability for smaller rj. Let 
r]k be a sequence as in Section [3751 

Note that, unlike in Section 13.61 each step k ^ fc + 1 of the continuity argument has to establish a 
statement for all z S D(r]k^i). 

Lemma 7.9. We have 

Proof. We proceed by induction on fc. The case fc = 1 was proved in Lemma [7751 We write 
Now for any w G -D(?/fc) and on Q.{r]k) we have, using (fm| . ([77^ . and (P7^ . 

AdH + AoH s; c(iog7V)-2« 

with i')-high probability. Using the estimate (|3.56p we find, for any z G D(rik+i), 

Ad{z) + lo{z) C(logiV)-2« 
with (^, i')-high probability. Using a lattice argument similar to Corollary 13.191 we therefore find 

The claim follows. □ 

This estimate ([2722]) now follows from (|7.14p . (|7.22p . p.20p . and the lattice argument of Corollary 13. 191 
This concludes the proof of Theorem 12.91 



7.3. Eigenvector delocalization: proof of Theorem 12.161 We may now prove Theorem 12.161 Dclocalization 
for the eigenvectors Vi, . . . , "Vn-i is an immediate consequence of the weak local semicircle law. From (|6.ip 
and Lemma 14.31 we find that fii, . . . , fi^^i € [— S, E] with (^, z^)-high probability. Let L = 8^ and set 
77 := (logiV)^. Using p.Sp . Lemma [7.1) and (|7.22p . wc therefore find with (^, ;y)-high probability 

C ^ ImG,,(,.„ + u,) = X. (^,-^,)2+,^2 ^ (« < ^) ■ (7-25) 

This concludes the proof of (|2.32p . Moreover, the same argument (with a = A'' in (|7.25[) ) proves (|2.34p if 
/ ^ E — 3, since in that case /imax G D with (^,z/)-high probability by (|6.5p and Lemma 2731 
Next, we note that (|2.33p is an immediate consequence of (|6.9p . 

In order to prove (|2.35p . we use the following large deviation estimate which is proved in Appendix [XI 
Lemma 7.10. For k ^ logiV and fixed i we have with (^, v)-high probability 

\{H^B){i)\ = /^"i^M..---^..-!.. < (logA^)'=«. (7.26) 
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Now from the expansion (|6.18[) we get, with ;y)-high probability, 





) 



and (|2.35|) follows since K = l + 0(/^^) (see ()6.27p ). In this argument we used that / ~ /imax ^ C'o(logA^)^ 
for some large enough Cq to overcome the logarithmic factors in (|6.18p that arise from (|7.26p . This concludes 
the proof of Theorem l2.16l 

Finally, we outline the proof of (|2.36p for 1 <C / ^ C(log A^)^. The idea is to use the same proof as for 
(|2.34p . relying on the estimate (|7.25p . In order to do this, we need the pointwise bound C ^ Gii{piN + ^v) 
which we get by extending the proof of Theorem l2.9l to a larger set D^. Here has to contain /i^r, so that 
we have to choose E = C(log A^)^ in the definition p.ip of Dl with some large constant C. 

This extension requires some modifications in our proof of the local semicircle law. Now instead of the 
bounds ()3.8p . we have c(log7V)~f ^ |msc(-z)| < 1 for z G Dl- We modiiy the definitions p.25p of il(z) 
and (|7.4p of ^{rj) by replacing (logiV)~? with (logiV)"^^. Then, on these events, we get the lower bound 
\Gii{z)\ > c(log7V)~« instead of (jX^ . One can then check that all estimates of Sections [3] ~ [7] remain 
valid with some deterioration in the form of larger powers of (log A^)^, provided that L ^ for some large 
enough C; we omit the details. 

7.4. Control of the average of 'J2k^i ^^il^hi- In this final section, we estimate the averaged quantity 



The estimate of 11 is not needed for the local semicircle law, but we give its proof here as it is a natural ap- 
plication of the abstract decoupling lemma, Theorem l5.6l The expression ()7.27p arises as an error term when 
controlling resolvent expansions of the non-centred matrix A. Such expansions are used in the companion 
paper [T^ to establish the universality of the extreme eigenvalues; see Section 6.3 in [T3j . 

Note that a naive application of the large deviation bound (|3.19p yields |n| ^ {\ogN)'-^^q~^ with (^, v)- 
high probability. In order to establish universality of the extreme eigenvalues in [13] . it is crucial that the 
factor q"^ be improved to . This is the content of the following proposition. 

Proposition 7.11. Suppose that the assumptions of Theorem \2.9\ hold. Then for any z G £'120(^+2) we have 
with {^,iy)-high probability 




(7.27) 




(7.28) 



Proof. We shall apply Theorem 15.61 to the quantities 





(U») 



(7.29) 



k 



k 
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and the event 



n 



max \Gij{z) - 5ijmsc{z)\ ^ X{z) 



Recall that the collection of random variables (2]"^')^ generates random var iables Zf'" through 1^. We 
choose p := (log A'')^ in Theorem 15.61 It is immediate that the assumptions (i) and (iv) of Theorem 15.61 are 
satisfied. By Theorem 12. 9[ the assumption (v) of Theorem 15.61 holds as well. 
We shall prove that, for any U C S with 1, i ^ §. |§| ^ p, and r ^ p we have 



E 1 [s],, z; 



S.V\r 



(y(CMX)")^ 



(7.30) 



where u := |U| + 1. Supposing (|7.30p is proved, both assumptions (ii) and (iii) of Theorem 15.61 are satisfied. 
Then the claim of Theorem [E^ ((5l^ . and Markov's inequality yield ([7^ . 

It remains to prove (|7.30|) . Throughout the following wc abbreviate u :— |1U| + 1. By the definition of 
Zf in ^ and (17:^ we find, for 1, i ^ §, 



i(z^i)(-i)i^\"i (-i)"''EGir^^.» 

V:S\UCVCS k 

im\v) 

k V:S\UCVCS\{fc} 
((»S)\U) 

l(z^l) Y /^hG™'=^'"^^'=\ 



where in the last step we again used (|5.ip . as well as Definition 15.81 and the fact that {{Si) \ {k}) \ (1U\ {fc}) 
{Si) \ U. Wc spfit 



where 



D, := lii^l)J2hk^^Gf 
Thus wc may estimate 



(S»)\{A;},U\{fc} 
fc 



(Si) 



D, := l{z^l)J2hk:Gf: 



(Si),U 



feeu 



E(l([S],)|zf"r) ^ 2^^E(l([S].)|i?, 



To that end, we shall make use of (|5.73p . Note that Lemma [5.111 is entirely deterministic. In particular, it 
applies if all quantities are defined in terms of A rather than H (and hence bear a tilde in our convention). 
We shall apply it to the Green function G. 

We start by estimating Di. Since G'f^*)^^'^'^''^^^''''^ in Di is by definition independent of the i-th column 
of H, for |U| |§| < p = (logiV)^ we get from ((575|) that 



(a)\{fc},u\{fc} 



1(S) Gf^»)\{'^-}>u\{fc} 



< (G|U|X)I 
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Here we used that Aq ^ X on ^. Now wc may estimate, using |U| ^ (logiV)^, 



E(l([S],)|i?ir) ^ (logiVr«nmxE|/i,,r(C|U|X) 



s$ (c {log N)'^q~\cmxy' 

(y(CuX)")\ 

Next, we estimate D2. As above, since G^^''*''^ in D2 is by definition independent of tlie i-th coiumn of 
H, for |1U| s$ (logiV)« we get from ([Ffg]) that 



^ (C|U|X)l"l+i 



Now we use the moment estimate (jA.2p with a ~ 1, (3 ~ —2, j ~ 1, and 

A, := l(fc^§U«)l([S]OG(f'". 

This yields 



E 



(l([S],)|i?2r) ^ (r(C|U|X)l"l+i) 



Here we used that Ak defined above is independent of the randomness in the i-th column of H. Thus we 
conclude that 



E(i([s],)ii?2r) (Yicuxyr- 

This completes the proof of (|7.30p . and hence of (|7.28p . 



□ 



8. Density of states and eigenvalue locations 



8.1. Local density of states. The following estimate is the key tool for controlling the local density of states 
- and hence proving Theorems 12.101 and 12. 121 

Lemma 8.1. Recall the definition ()2.14p of the distance ke from E to the spectral edge. Suppose that the 
event 

j - '".Ml < (log iV)« (-i" j;;!^ . ^} + 1^) ) (8.1) 

holds with v)-high probability for L CoC; where Co is a positive constant. For given Ei < E2 in S, S] 
we abbreviate 

K := min{KBi,K£;J , £ := maxjE'z - £■!, (log7V)^iV"^} . (8.2) 
Then, for any — E ^ i^i < E2 ^ we have 

(H(S2) -H(Si)) - {nsc{E2) - nsc{Ei)) 

with (^,iy)-high probability. 



^ (logiV)^« 



£ 



(8.3) 
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Proof. Recall the definitions (|2.1ip and (j2.12p . Similarly, we have 
Thus we may write 

= ^TrG(z) - J-^- 

We introduce the differences 

:= g-Qsc, ■= fh-msc- 

Following [TJ, we use the Helffer-Sjostrand functional calculus. Set rj := (logiV)^iV^^. (Recall that 
L = Co^.) Let X be a smooth cutoff function equal to 1 on [—£,£] and vanishing on [—2£,2£]'^, such that 
lx'(2/)l ^ C£~^. Set T] := iV~^ and let / be a characteristic function of the interval [Ei,E2] smoothed on 
the scale rj: f{x) = 1 on [Ei,E2\, f{x) = on [Ei - t], E2 + r]]" , \ f'{x)\ < Ct?"!, and ^ Crj-^. Note 

that the supports of /' and /" have measure 0{rj). 

Then we have the estimate (see Equation (B.13) in |14) ) 



/(A) Q^{\)A\ 



^ c 









dyif{x)+yf'{x))x'{y)m^{x + iy 



C 



da: / dy/ {x)x{y) y\-a\m (.t + iy) 



C 



da; 



dy f"{x)x{y) y Im m^(x + iy) 



(8.4) 



Since x' vanishes away from [£, 2£], the first term on the right-hand side of (|8.4p is bounded, with z^)-high 
probability, by 



C 
£ 



dx 



2£ 



dy\f{x)+y.nx)\{\ogN) 



1 



1 



^V^T£ ' N£) 



£ 



where we abbreviated k := nim{KEi , k^;^}. In order to estimate the two remaining terms of (|8.4|) . we estimate 
Imm^(a; + iy). If y ^ ^ we may use (|8.ip . Consider therefore the case < y ^rj. From Lemma [321 we find 



\lmmscix + iy)\ ^ Cy/nx + y . 



(8.5) 



By spectral decomposition of A, it is easy to see that the function y ylmm(x + iy) is monotone increasing. 
Thus we get, using (jS.Sp . x + irj G D^, and (|8.ip . that 

_ _ _ I I \ (logTV)'-^^ 

ylmm{x + iy) ^ rjlmm{x + irj) < (logiV)'^^?] \/ Hx + V -\ ^ tt^ ^ 77 iu^v) (8.6) 

\ q I\7] J 



N 



with (^, i')-high probability. From (|8.5p . to = to — jtIsc, and the definition of 77, we find 



|j/ImTO'^(a; + i?/)| ^ 



(logiV) 

AT 



(y < v) 



with (^, ;/)-high probability. Since rj ^ r/ we therefore find that the second term of 
(^, i/)-high probability, by 



(8.7) 

is bounded, with 



C(logiV)^« 



N 



dx\r{x)\ fdyx{y) < ii^i^ 
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In order to estimate the third term on the right-hand side of (|8.4p . we integrate by parts, first in x and 
then in y, to obtain the bound 



C 



dx /' [x) T] Re m'^ (x + irj) 



C 



Ax \ dy f{x)x{y)yRcm'^{x + iy) 



C 



dx I dy f' {x)x{y)^cm^{x + \y) 



(8.8) 



The second term of (|8.8|) is similar to the first term on the right-hand side of (|8.4p . and is easily seen to be 
bounded by 



In order to bound the first and third terms of (|8.8p . we estimate, for any y ^r], 



\m'^{x + iy)\ |m'^(a; + ir/)| + / dw ( |9Km(a; + iw)| -f |9„m5c(2; + iit)| j 

Jy 

Moreover, using (|8.6p and p.27p . we find for any u ^rj that 



9„ 771(2; + iu) 



N 



TrG^{x + iu) 



^ — y^lGi, (x -|- iu)|^ = — Im7n(a; + iu) ^ —^rjlmmix + irf) 
N ' u 



with (^, i')-high probability. Similarly, we find from (|2.12p that 



\dumsc{x + iu)\ ^ — rjlmniscix + ir]) . 



Thus dm]) yields 



\m^{x + iy)\ ^ (logA^)^«(l+ j' du\\ s$ (logAr)'^«- 

\ Jy ^ / y 



{v ^ v) 



(8.9) 



with (^, i')-higli probability. 

Using ([8^1) . we may now bound the first term of ((8?8)) by (logiV)'^^7V~^ 

What remains is the third term of (|8.8p . We first split the y- integration domain [77,00)] into the pieces 
[ry,7y] and [77,00). Using (|8.9p . we estimate the integral over the first piece, with (^, i/)-high probability, by 

d:r|/'(x)|^''d2/|7n^(a; + iy)| O^i^ . 

Using (|8.ip . we may therefore estimate the third term of (|8.8p . with (^, i^)-high probability, by 



(logiV) 
TV 



+ {\0gNfiJ dx P dy\f{x)\l^-^ + 



1 1 



VK^+yq- 



(i^ + (log7V)-^/d.|/'(.)| 



2£ 



1 1 



28 



1 



Ny Jfj V^ + y 



< (log TV) 



1 



N q^^T^ 
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Summarizing, we have proved that 



/(A) g^iX)dX 



^ (log AT) 



with ;/)-high probabihty. 

In order to estimate \n{E) — nsc{E)\, we observe that (|8.6p imphes 

|n(x' + 77) — n(x — ^ Cr/lm.rh(x + ir/) ^ 
with i')-high probabihty. Thus we get 



(logiV) 
N 



i=l,2 



n{E,)^niE2)- J f{X)g{X)dX 

with i')-high probabihty. Similarly, since g^c has a bounded density, we find 

n.ciEi) - nsciE2) - J /(A) gs^X) dX 

Together with ([8ll3| . we therefore get (|83)) . 

We draw two simple consequences from Lemma l8. II 



C(logjV)'^^ 
TV 



' N 



Af{E^,E2) - Afsc{Ei,E2) 



1 + 



(iogiv)c« , iv £;2-£;i 



(8.10) 



□ 



Proposition 8.2 (Uniform local density of states). Suppose that A satisfies Definition \2.2\ and that 
^ and q satisfy p.lSp . Then, for any Ei and E2 satisfying E2 ^ Ei + (log N)^^N~^ we have 



.11) 



^Afsc{Ei,E2)\ ^K + E2- El 

with {S,,v)-high probability, where we abbreviated k :~ min{K£;j, k^;^}. 

Proof. By (g^Ol), the estimate (|5T^ holds. Assuming -Y. ^ Ei E2 S, we get from ([O)) . with 
(^, z^)-high probability, 



\Af{Ei,E2)-AfsciEuE2)\ ^ (logAr)^«(l 



N E2- El 

y/K + E2- EiJ ' 

from which the claim follows. If Ei < — E and E2 ^ S, the claim follows by replacing Ei with — S and using 
Lemma [4.41 The other cases where — E ^ £^1 ^ £^2 ^ S does not hold are treated similarly using Lemma 
01 □ 

The proof of Theorem 12.101 is completed by observing that both its statements, (|2.25p and p.24p . are 
special cases of ()8.1ip . (Recall that in the bulk we have Nsc{Ei, E2) ~ N{E2 ~ Ei); at the spectral edge we 
have JVsc{Ei,E2) ^ N{E2 - Sl)^/^ which is sharp for Ei = -2.) 
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Proof of Theorem 12.121 Let us assume that ^ 0; the ease i? > is treated similarly. Setting 

El := -2- (logiV)'^i«(q-2 +iV-2/3) 

for some eonstant Ci > 0, we find that jisc{Ei) ~ and n{Ei) = with i/)-high probability for Ci large 
enough, by Lemma [4.41 We may assume that E ^ Ei. 

By (|2.20p . the estimate (|8.ip holds. Therefore setting E2 = E in Lemma [ST] yields 

\n{E)-nUE)\ ^ (logiV)^« f 1 + 1^Je-Ei + {log N)C^N-^) ^ (logNf^^^ . ^ . 



with i^)-high probability. This holds for any fixed E. The claim (|2.27p . which is uniform in E, now follows 
by a lattice argument similar to Corollary I3.19[ whereby we choose a lattice of points G [— S, E] with 
\E,+i - Ei\ < N-^; we omit the details. □ 



8.2. Eigenvalue locations. The following result contains the main estimate on the locations of the eigenvalues 
of A. Recall the definition (|2.28p of the classical location 7q, of the a-th eigenvalue. 

Proposition 8.3. Suppose that A satisfies Definition \2.2\ and that ^ satisfies (|2.15p . Let (p be an exponent 
satisfying < (j) ^ 1/2 and assume that q = N'^ . Then the following statements hold with {^^v)-high 
probability for all a ^ 1, . . . , N — I, for some sufficiently large constant K . 

(i) Ifmax{K^^,K^J ^ (log7V)^«(7V-2/3 _,_7v-20) then 

lMa-7al ^ (logiV)C?(iV-2/3 + Ar-20). (8.12) 



(a) Ifma.x{K^^,K^J > (logiV)^«(iV-2/3 + 7v^20) ^^^^ 

|/^a-7a| ^ (log7V)^«(7V-2/3S-l/3+ ^2/3-40^-2/3 +^r-20) (8.13) 

where we abbreviated a := minjo;, N — a}. 

Proof. To simplify the presentation, we concentrate only on the eigenvalues ^i, . . . , fJ.N/2- The remaining 
eigenvalues • ■ • , fJ-N-i are dealt with similarly, using Lemma 14.41 and the estimate fj,N ^ 2 + c which 

holds with (^, z^)-high probability. 

We define the event O as the intersection of the events on which (|4.30p holds and on which 

\n{E)-n,,iE)\ ^ (log 7V)^»« (^1 + 1 + ^) (8.14) 

holds for all e S, S] and some positive constant Cq. Recalling (|2.27p and (|4.30p . we find that ft holds 
with (^, i/)-high probability for large enough Cq. Note that on f2 we have fiN/2 ^ 1- Indeed, the condition 
fJ'N/2 ^ 1 is equivalent to n(l) ^ 1/2, which follows from (j8.14p and the fact that nsc{l) > 1/2. 
Abbreviate ( := min{2(/), 2/3} and let Ci > Cq. We use the dyadic decomposition 

2 1ogAf 

{l,...,iV/2} = U C/fc, 

fc=0 
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where we defined 

Ua := ^ N/2 : 2 + maxima, fJ-a 

Uk |a s; N/2 : 2''{\ogN)^^^N~'^ < 2 + max{7„, /x^} ^ 2''+^ (log Nf^^N''^^ 

By definition of C/q and Lemma on we have 

|Ma-7a| ^ (log7V)^«7V"« (aeC/o). 

This proves ([51^ . 

Next, let k ^ 1. From (|8.14p we find that on we have 



for A: ^ 1 



TV 



.(7a) = n(/x„) = n,,(/x„) + (logiV)^««0 — + — 



1 1 



(8.15) 



On ri and for a G Uk, the second term on the right-hand side of (|8.15p may be estimated as 



(logiV)^««of - + 4 + ^ (log7V)^««(iV-i +TV-30) ^C'2('^+i)/2(log7V)(^°+c^i/2)«TV-';/2-2^, 

VTV q-^ q^ J 

since ^ 2 + /.Jq,. Moreover, fl and for a £ Uk we have 

n.e(7a) + n..(M.) > c23^/2(log iV)(3/2)^^^iV-=^^/^ 

where we used the simple estimate n,,c(-2 + - for s$ a; 3. Thus we have, on and for a € Uk, 

(logTV)C^o«o(^i- + l + ^^^ < n«,(7„) + n,,(/x„), 

from which we deduce using (|8.15p that 

n«c(/ia) = n,,c(7a)(l + o[(logTV)-(^^-^''^«]) . 

Thus we find, on and for a £ Uk that 2 + 7^ ~ 2 + and hence n'^^{x) ^ n-gcila) for any x between 7q, 
and iia- Here we used that n'g^{x) ~ (rise (a;) )^^^ ~ \/2 + a; for — 2 ^ a; ^ 1. Thus the mean value theorem 
and (|8.15p imply, on and for a g Uk, 



\Ha-"fa\ ^ 



C\nsci^J,a) - Wse(7a)| 
<c(7a) 

C(logTV)Co€ 
(a/TV)i/3 

C(logTV)'^"? 



. (^7V-2/3 + 7V1/3-30 + ^-20^1/3 ^ 7Vl/3-20^|^^_^^|^ ^ 

where we used that ^ K-y^ + — 7q| and ^ {a/N)"^^^. Thus we find, on and for a G [/fc, 

|/Xa-7a| =^ (log7V)^«(^TV-2/3c,-l/3+iv2/3-40c,-2/3^^V 



This proves (pTS]) . 



□ 
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Proof of Theorem 12.131 We apply Proposition 18.31 As before, we only deal with the eigenvalues a ^ 
the proof for the eigenvalues N jl < a ^ — 1 is the same. Suppose that a satisfies Case (i) of 
Proposition [531 Using ajN = rise (7a) ~ (2 + 7q)^/^ wc find that 

a ^ (logA)^^(l + (8.16) 
Therefore we get, squaring (|8.12p and (|8.13p and summing over a, 

Af-l 

^ iMa - 7a P ^ (log Nf^ (1 + (iV-4/3 + N'^'^') + (log Nf^ (n^^ + iV4/3-8<> + 

a=\ 

with (^, ;/)-high probability. This concludes the proof of p.29|) . 

Finally, we note that (|2.30p follows from (|8.12p and (|8.13p as well as the above observation that Case (i) 
in Proposition 18.31 implies (|8.16p . □ 



A. Moment estimates: proofs of Lemmas I3.8L I4.3L 16.51 and 17.101 



In order to prove Lemma 13. 8[ we prove the following high moment bounds, which are also independently 
useful. 



Lemma A.1. (i) Let (ai) he a family of centred and independent random variables satisfying 

CP 



ap+P 



(A.l) 



for all 2 ^ p ^ (log A^)"^oiogiog^^ where a ^ and /3,7 G R. Then for all even p satisfying 2 p ^ 
(log7V)'4oi°si°s^ we have 



E 



^ {CpY 



M+2a 



1/2- 



(A.2) 



for some constant C > depending only on the constant in (jA.ip . 
(ii) Let oi, . . . , ajv be centred and independent random variables satisfying 

CP 



(A.3) 



for all2 p (logAr)^oiogiogiv^ j,^^^ ^^^^ ^ satisfying 2 p (log AT) log log jv 

Bjj Cz C we have 



E 



maxi^jlBi 



1/2- 



(A.4) 



for some C depending only on the constant in (jA.3p . 
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Proof. We begin with (i). To prove (|A.2[) . we set p = 2r and compute 



E 



2r 



^ ^ • • • Ai^Ai^^-^ ■ ■ ■ Ai^^ Ectij • • • cLi^ai^j^-^ ■ ■ ■ Ui^^ ■ (^-5) 



Each configuration of labels (ii, . . . ,«2r) defines an equivalence relation (or partition) T on the index set 
{!,..., 2r} by requiring that the indices j and A; are in the same equivalence class if and only if their labels 
satisfy ij = ik- We organize the summation over the labels ii, . . . ,i2r by (i) prescribing a partition F of the 
set of indices, (ii) summing over all label configurations yielding the partition F, and (iii) summing over all 
partitions F. Thus, let a partition F be given. Let I denote the number of equivalence classes of F, and 
order the equivalence classes in some arbitrary fashion. Let the size of equivalence class s; clearly, we 
have ri + • • • + r; = 2r. Moreover, since the random variables are centred, we find that each equivalence 
class has size at least 2; in particular, ^ 2 for each s and hence I ^ r. Using the independence of the a^'s, 
we thus find that the contribution of the partition F to (jA.5[) is bounded in absolute value by 



ii ,. . s— 1 



n Ei^^ 



(A.6) 



Abbreviating A := maxi|Ai|, we find that (|A.6[I is bounded by 



Yl (CAq-'^y^^A-^N-'q-^ 



f 1 



\A'^mqp 

max< 1, 



i 

1 

A^mqP 



Ei^^ 



A_ 



Niq' 



Ei^^ 



1/2- 



Next, it is easy to see that the total number of partitions of 2r elements is bounded by (Cr)^*", so that 
we get 



E 



y^^AjUi 



2r 



^ {Crf 



A 



Niqf}+2a E'^^l^) 



1/2- 



2r 



This concludes the proof of (|A.2[) . 

The proof of (|A.4p needs more effort. Without loss of generality, we set Bu — for all i. As above, we 
set p — 2r. Wc find 



2r 



E 



X Ea.i^ai^ 



As above, we associate a partition F(i) = F = {7} of the index set {!,..., 47-} with every label configuration 
i = (ii, . . . ,i4r) by requiring that k and I are in the same equivalence class of F if and only if ik ~ ii- We 
rewrite (jA.7|) by first specifying a partition F and summing over all label configurations i satisfying F(i) — F, 
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and subsequently summing over all partitions F. Note that a partition T yields a nonzero contribution to the 
right-hand side of (|A.7p only if (i) each equivalence class contains at least two indices, and (ii) [2k— 1] ^ [2k] 
for all fc = 1, . . . , 2r; here [n] denotes the equivalence class 7 9 n of 7i in F. 

Next, we encode F using a multigraph (i.e. a graph which may have multiple edges) G = G{T) defined 
as follows. The vertex set of G is the set of equivalence classes {7} of F. Each factor Bi^^ yi^,. or Bi^^ ^i^^ 
gives rise to an edge of G connecting the vertices [2k — 1] and [2k]. Note that, by property (ii) of F, no edge 
of G connects a vertex to itself. Moreover, G has 2r edges. 

Let G be a multigraph with v vertices. We define the value of G through 



^{7,7'}6£(G) 



where 5^ is the degree of 7 in G. 

Fix a partition F. We claim that the contribution to the right-hand side of (|A.7p of all label configurations 
i satisfying F(i) = F is bounded in absolute value by ^{GiT)). This is an easy consequence of the 
definition of G(F): Each vertex 7 carries a label j^, and the contribution of vertex 7 is bounded by E|a,;^ [^'' ^ 
C^i {Nq^''^"^)^^. (Note that, by the property (i) of F, we have ^ 2. Here we also used that ^7 = 4r.) 

Next, we estimate y{G{T)). Let Go := G(F). The idea is to construct a sequence of multigraphs 
Go,Gi,...,Gs by successively removing edges incident to vertices of degree greater than two, until all 
vertices have degree at most two. 

If all vertices of Gg have degree at most two, set s = 0. Otherwise, pick a vertex 7 of Gq with degree 
greater than two, and let 7' be adjacent to 7. Define i?(Go) as the multigraph obtained from Gq by removing 
an edge connecting 7 and 7'. We claim that 

y{Go) < ^r(i?(Go)) (A.9) 

(regardless of the choice of the removed edge). Here we abbreviated Bo := maxi^j|i3y |. The estimate (|A.9[) 
is obtained by estimating [Bi^i^, [ ^ Bo in (jA.Sp . and by noting that [6.y — 2]+ in Go is strictly greater than 
in i?(Go). Now set Gi := i?(Goy 

We continue inductively in this manner, generating a sequence Go, . . . , Gs of multigraphs with the prop- 
erties that Gfe+i = R{Gk) (for an immaterial choice of R), Gs has 2r — s edges, and all vertices of Gs have 
degree at most two. By (jA.9p . we have 

r(Go) < (^^^ riGs). (A.io) 

Next, it is immediate from its definition that Gs is a disjoint union of simple closed and open paths. Here a 
simple open path of length / ^ is the graph with vertices 1, . . . , l + l and edges {1, 2}, ...,{?, ^ + 1}; similarly, 
a simple closed path of length Z ^ 2 is the graph with vertices 1, . . . , / and edges {1, 2}, . . . ,{l — 1,1}, {1, 1}. 

From the definition (jA.Sp we immediately find 

r(GuG') = r(G)r(G'), (a.ii) 

where U denotes disjoint union. We shall now prove that, if G is a simple (open or closed) path of length I, 
we have 
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Using ((XTOt . (TOT]) , and ((XT2|) . we find that 

(2r-s)/2 



9 / V^^ 

Let us now prove (jA.12p . We start with a simple closed path of length Z, whose value (jA.Sp is given by 
Assume first that / = 2fc is even. Then 

\n,---,i2fc / \ii,...,i2fc / 



i/2 



If i = 2A; + 1 is odd we find 



^2fc+l - ]^^fe+T E 1^*1*2 I ( E l-^^2i3l ■ ■ ■ I^i2fc+in I 

^ jV2fc+i [ E E l-Si2i3|---|-B,2fc+i«il|5i2i^|---|^ik-+i»il] 

\'il,'i2 / \n ,...,j2fc+l i3,...,j2i,^i / 

1/2 



This proves (|A.12[) for closed simple paths. Consider now an open simple path of length I, whose value (jA.8|) 
is 

JJT+T E \^ili2\■■■\B^,^,^^\■ 

li I = 2k is even we get 

^ jV2fc + l ( E \^hi2\\Bl3iA- ■ -1^12^-112^] ( E |-^i2i3ll^»4»5l • • ■ |5j2fc»2fc + ll 1 

\zi,...,i2fc + l / \il ,---,'^2fc + l / 



i/2 
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Finally, if ^ = 2/c + 1 is odd we find 



I 



Ei^M2n 



*2fc + l*2fc + 2 I 



1/2 



1/2 



0l/2 < 



IB,;, 



i/2 



l^i2fc + li2fc+2 I 1^1243! 



2fc+l 2fc+2 ' 



This concludes the proof of (|A.12I) . 

Thus we get from (|A.13|) that the contribution to the right-hand side of (jA.7|) of all label configurations 
i satisfying r(i) = F is bounded in absolute value by 



Br. 



1 \ l/2\ ' 



In order to conclude the proof of (jA.4p . we need a combinatorial bound on the number of multigraphs of 
the above type containing 2r edges, as well as on the number of partitions F associated with any given 
multigraph G. Their product is easily seen to be bounded by (Cr)"''". This completes the proof of (|A.4I) . □ 

Proof of Lemma 13.81 The proof is a simple application of Lemma [A. II and Markov's inequality. 

In order to prove (i), we choose p = v{[ogN)^ in (|A.2p and apply a high moment Markov inequality. 
Next, we prove (ii). The bound (|3.19p follows immediately from (i) by setting a = 1, /3 = — 2, and 7 = 1. 



Similarly, the bound p.20p follows easily by applying (i) to the random variables 



erf and setting 



Ai = Bii] here a = 2, /3 = —2, and 7 = 1, as can be easily seen using (|3.18p . Morover, the claim p.2ip 
follows by setting p = v{\og N)^ in \AA\ and applying a high moment Markov inequality. 
Finally, we prove (iii). Write 



E ^i^ij^J 






+ 


^aiB^jbj 






i 







The first term is dealt with by noting that the random variables ai6i, . . . , a^bN are independent and satisfy 
([XTB]) for a = 2, /3 = -4, and 7 = 2. Therefore ([5T7)) yields with i^)-high probability 



a-iBiibi 



^ (logTV)^ 



Bd 
^2 



]^Ei^" 



1/2- 



^ 2(logiV)«^ 



In order to bound the off-diagonal terms, we set Ai := X^j^i ^ij^j- Then we may again apply p.l7p to 
get with (^, i/)-high probability 



I A, I (logA^)« 



1/2' 
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Since Ai is independent of aj, we therefore get from p.l7p 



i^j i 



s; (iogiV)« 



q 

Bo 

q 



1/2- 



1/2- 



with ;/)-high probability. We remark finahy that the constant C may be absorbed into the smah constant 
V when applying the high moment Markov inequality used to prove p.l7p . □ 

Proof of Lemma [6.51 To prove (|6.19p for fc = 1, we estimate with i')-high probability 



(e,i/e) 



Ij]/.,, = o((logiV)«7V-V2)^ 



2J 



where we invoked (|2.5p and applied p.l7p to the 0(7V^) variables \hij ■ i < j} (and similarly for i ^ j) with 
a = 1, /? = —2, and 7 = 1. 

If fc ^ 2 we use a high moment expansion. The following notation will prove helpful. We abbreviate 
a = and write ha '■= hij. Defining 

we may thus write 



{AAA) 



where lE(-) := (•) — E(-). In order to make all matrix entries independent of each other, we split H = H' + H" 
into two triangular matrices, where 

This results in a splitting of (|A.14|) into 2'^ terms, of which we only consider 



ai,...,ak 



(the other terms are dealt with in exactly the same manner and the resulting factor 2'"' is immaterial). 
We abbreviate ex = (ai, . . . , ak) and write 



where we defined 



Ca j^Baia2Ba2a3 ' ' ' Bq^ ^q.^. /l^^ • • • h^^ 
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For even p S N we get therefore 

EXP = e[(c„i -Ec„i)---(c«p -ec„p: 



(A.15) 



,. . . ,ckP 



By independence of the family {ft-J^,}, we find that a summand in (jA.lSP indexed by a vanishes if there is an 
r such that [a.^] n [a."^ ] = for all r' ^ r. Here [(ai, . . . , ak)] '■= {ai, . . . , au}- Thus we find 

EXl = J2 e[(C„i -IEC«0---(C«. -EC„p)]x(a\..-,aP), (A.16) 

where 

Xia\...,an := 1 (3/ : K] n K'] ^ 0) . 

For each given label configuration a = (a'') = («[), we define a partition r(a) of the index set {{r, I) : 
r = 1 , . . . , p, I — 1 , . . . , k} hy imposing that (r, I) and (r', I') are in the same equivalence class of T{a) if and 
only if Uri — olt-'V ■ We now perform the sum over a in (jA.16[) by first specifying a partition F and summing 
over all <y. satisfying F = F(q!), and then summing over all partitions F. Note that any partition F yielding 
a nonzero contribution to (jA.16p satisfies the two following conditions. 

(i) Each equivalence class of F contains at least two elements. 

(ii) For each r = 1, . . . ,p there are r' = 1, . . . ,p and 1,1' = 1, . . . , fc such that (r, I) and (r', I') are in the 
same equivalence class of F. 

Condition (i) follows from the fact that h'^ is centred, and Condition (ii) from the definition of x- 
Let us fix a partition F satisfying (i) and (ii). Its contribution to (|A.16p is 



Ef(C„i -IEC«0---(Co, -IEC«p)lx(a) 



oi:r(oi)=r 



^ Y E[(|C«i|+E|C„i|)---(|Cop|+IE|Cop|)]x(a)- 

(A.17) 

Next, we note that F gives rise to a multigraph G = G{T) defined as follows. The vertex set V{G) is 
given by the equivalence classes of F. Each pair {(r, /), (r, / + 1)}, Z = 1, . . . , fc — 1, gives rise to an edge that 
connects the vertices 7 9 (r, /) and 7' 9 (r, Z + 1). Thus, the set of edges E{G) of G contains p{k — 1) edges. 
The interpretation of the edges is that each factor Ba^ i+i on the right-hand side of (|A.17|) is represented 
with an edge. 

The expectation on the right-hand side of the identity 



E[(|C„i|+E|Cc.i|)---(|Cop|+E|C„p|) 
is bounded by 



1 



■fc-i 



.1=1 



E 



V V 



r=l \l=l 



1=1 



2P n 



7ey(G) 



Cl7l 

Nqh\- 



75 



where I7I denotes the size of the equivalence class 7; this is a simple consequence of (|2.5p and the constraint 
q ^ CN^/"^ . By construction of G, each vertex 7 of G carries a label a^. Thus we may bound (|A.17[) by 



2P_ ^ 



n 



ai,...,a„ L{7,7'}e£;(G) 



n 



J 7eV(G) 



Nq\-y\-^ ' 



where u = |1^(G)| denotes the number of vertices of G. Here we dropped the factor x, and the restriction 
that ai, . . . , be distinct, to obtain an upper bound. By property (i) above, we have I7I — 2 ^ and we 
get the bound 

Next, we split G = Gi U • • • U G( into its connected components; here / denotes the number of connected 
components. An immediate consequence of the property (ii) of F is the bound 



I ^ p/2, 



Thus (jA.lSp becomes 



n 



Br, 



(A.19) 



(A.20) 



where Vj = \V{Gj)\ denotes the number of vertices in Gj. 

In order to estimate the contribution of the j-th connected component, we pick a root rj G V{Gj) and 
a spanning tree Tj of Gj. First, we use the trivial bound Ba^a^, ^ 1 for edges that do not belong to Tj. 
Second, we sum over all of the Vj — 1 non-root labels a-y, starting from the leaves of Tj, and using the identity 



N 



at each step. Third, we sum over the root label 7^.. , which yields a factor bounded by 7V^. Putting everything 
together yields 

ai,...,a„^ {7,7'}eS(Gi) 

Returning to (|A.20|) . we thus find that the right-hand side of (jA.17l) is bounded by 



(jpk 



N' 



v+l 



(Jpk 



where we used C09l) . 

Since the number of partitions F is bounded by {kp)^P , we get the bound 



( {Ckp) 

\ iVl/2 



k\P 



Choosing p = ^^^(log N)^ and applying a high moment Markov inequality completes the proof. 



□ 
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Proof of Lemma 17. 101 The proof is similar to (in fact considerably simpler than) the proof of Lemma 
16.51 We only sketch the argument, using the notation of the proof of Lemma 16.51 without further comment. 
Write 

ii,...,ik ai,...,ak 

where ao := Then, as in the proof of Lemma [6.51 we write as a sum over partitions T which 

give rise to multigraphs G = G{T) whose edges are given by the factors B and whose vertices are given by 
equivalence classes 7 of the set {1, . . . , fc} x {1, . . . to which has been adjoined a distinguished vertex 79. 
The vertex 70 corresponds to the fixed label ag, and it has degree p. Each multigraph G has pk edges, and 
is connected. In this fashion we find that the contribution of the multigraph G to EX^ is bounded by 



E 



n 



,{7,7'}eS(G) 



n El/i^jl-^l, (A.21) 

7eV(G)\{7o} 



where the first sum ranges over families {ct'y)j^v{G)\{-ya} of labels; every vertex 7 7^ 70 carries a label a-y 
which is summed over. The vertex 70 carries the label ao which is fixed. 

Since E/iq, = 0, it is easy to see that I7I ^ 2 for all 7. Choosing a spanning tree of the connected graph 
G, one therefore finds that ()A.21|) is bounded by 



\2\ 



j^\ViG)\-i Yl (max 

7GV(G)\{7o} 

Since the number of partitions F is bounded by {kp)''^, we find EX^ ^ {Ckp)''^ for p ^ (log A^)^. Choosing 
p = 2^(log-/V)^ and applying Markov's inequality completes the proof. □ 

Proof of Lemma 14.31 Our proof is a standard application of the moment method, along the lines of ^T\ . 
Lemma 7.2. 

In a first step, we truncate the entries hij. Let Ci = C be a constant for which Lemma |3 . 71 holds . Define 

Aiy := E/i,,l(|/iy| < Giq'^). 
Choose an independent family (A^ ) of random variables, independent of H, such that 

P(Ay = g-i) = fi,jq , P{X,j =0) = 1 - fi^jQ . 

Now set 

h,j := h,jl{\h,j\ < Cig-i)-Ay. 
It is easy to see that ^ ^ and therefore 

nh,,^h,j) ^ e-^^'^s^)'. (A.22) 

Moreover, we have 



m,, = 0, |%| ^li-tl, E\h,,\' < l(i+e^''('°g^)'). (A.23) 
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By (|A.22|) . it suffices to prove that || s$ 2+ (logiV)?^"^/^ with i/)-high probabihty. We shah prove 
that, for even k ^ Cy/q, we have 

jETrH*^! 3Nk2'' . (A.24) 

In order to prove (|A.24p . we write 

ETvH'^ = Eh,,,, ■ ■ ■h,,_,,,h,,,, (A.25) 

and apply a graphical expansion to the right-hand side. Before giving its precise definition, we outline 
how it arises from (|A.25p . Let the label configuration ii,...,ik be fixed. We represent each index j = 
1, . . . , fc by a vertex [j], whereby two indices j and j' correspond to the same vertex if their labels agree, 
ij = iji . Let p be the number of vertices. We then construct a closed walk through the sequence of edges 
([1], [2]), ([2], [3]), ... , ([fc], [1]). The walk has k steps. We name the p vertices 1, . . . ,p, whereby vertex v is 
reached after all vertices 1, . . . , w — 1. Since Ehij ~ 0, it is easy to see from (jA.25[) that each edge of the walk 
must appear at least twice. 

We may now give a precise dcffirition of such walks. Let w = {wi, . . . ,Wk) be a sequence with Wy e 
{1, . . . ,p}. With w we associate a multigraph G{w) as follows. The vertex set of G(w) is {1, . . . ,p}; the 
edge set of G(w) is given by the undirected edges {wi, W2}, . . . , {wk-i^Wk}, {wk^wi}. (Note that G(w) may 
contain multiple edges as well as loops). We say that w is an ordered closed walk of length k on p vertices if 

(i) A vertex that is visited for the first time at time j is greater than all vertices visited before time j: 
maxj'^j Wji ^ maxj'<j Wj/ + 1. 

(ii) All vertices are visited: {wi, . . . ,Wk} = {1, . . . 

(iii) Every edge of G(w) appears at least twice. 

Let yV(fc,p) denote the set of ordered closed walks of length k on p vertices. The key combinatorial 
estimate of our proof is the bound 

proved in [39] . Using the notion of ordered closed walks, it is not hard to see that (|A.25|1 may be rewritten 
as 

fe/2+1 

ETiH'^ = ^ ^ ^ IE/l£(^^)£(^2) • • • , (A.26) 

P=i wew{k,p}eec{p) 

where C{p) is the set of all p-tuples £ = {£{1) , . . . , i{p)) £ {1,...,N}p whose components are disjoint. 
See [21], Section 7.1, for a detailed proof. 

Next, associate with the multigraph G(w) its skeleton S{w), obtained from G(w) by discarding the mul- 
tiplicity of every edge (i.e. by successively removing edges until it has no multiple edges). For e € E(S(\v)), 
we denote by i^(e) the multiplicity of the edge e in G(w). We have the obvious relation X]ee£;{S(w)) ^(^) ~ k- 
If e = {v,v'}, we write £{e) := {£{v),£{v')) (the chosen order of the pair £{e) is immaterial). Then it is easy 
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to see that 



eeE{S(vr)) ^ 



q 



(1+e- 



-i/(logA')« 



where we used (jA.23[) . and introduced the shorthand Eg |_B(5(w))|. Therefore summing over £ S ^(p) in 
(|X26l) yields 



fe/2+l 

ITrH'^l ^ H 

p=l w6W(fe,p) 



(1 + e- 



-c(logAr)«-| 



-Es /^\ fc 



Next, it immediate that we have the relations p — 1 ^ Es ^ fc/2; these inequalities follow from the above 
properties (ii) and (iii) respectively. Since d'^/N ^ 1, we therefore get 



fc/2+l 



jETri?*^! ^ N J2 |W(fc,p)|(l + e-'=(i°sA')«)pJ £ 



For k ^ N this yields 



where 



fe/2+l 



fc-2p+2 



Sik,p) 



k 

^2p - 2 

It is elementary to check that S{k, k/2 + 1) = 2'' and 



p2(fe-2p+2)22p-2/ £ 

q 



fc-2p+2 



S{k,p) < 



fc4 /c 



8 \q 



S{k,p + 1). 



Therefore choosing k ^ Cy/q implies S{k,p) ^ 2*^. This concludes the proof of (jA.24[) . 

The claim now follows by setting k ~ c^/q with a sufficiently small constant c, applying a high moment 
Markov inequality, and recalling that y/q ^ (logiV)^ by ()2.6p . □ 
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